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Preface 


With many experimental undertakings around the corner such as the Large 
Hadron Collider, with many neutrino experiments already in the pipeline or 
under way and with aspirations for perhaps an International Linear Collider 
in not too distant future, the field of Particle Physics is poised to take a 
giant leap into unravelling the unknown world of new particles and forces 
in the coming decades and build on its success of the past four decades. 
Combined with the spectacular developments in the field of cosmology, 
which has perhaps already given us the standard model of the universe and 
begging for new ideas from particle theory for a deeper understanding of 
observations, the promise of major breakthroughs and deep insights have 
filled the air. Many exciting ideas such as supersymmetry, extra dimensions 
and grand unification are reaching a stage of maturity waiting to be tested. 
We may also learn about the true nature of the dark constituent of the 
universe, as well as about the happenings at the early moments of the Big 
Bang embodied in the ideas of inflation. These discoveries may also provide 
a better understanding of the formation of structure and evolution of stars 
and galaxies. 

In order to prepare for this new era, the TASI summer school has always 
been structured to bring to Ph. D. students in the US and abroad the 
latest ideas and information in a cogent and pedagogical manner, so as to 
build the intellectual base for tackling the new theoretical challenges that 
will emerge and are already emerging. The 2006 TASI school was charged 
with bringing the new phenomenological, cosmological and model building 
frontier to the students and researchers of tomorrow. With this in mind, 
we decided to focus on two main themes: Colliders and Neutrinos at the 
frontier of Physics and inviting experts in the related fields to lecture at 
the school. 
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Chapter 1 


Extra Dimensions 


Kaustubh Agashe 


Department of Physics, Syracuse University, 
Syracuse, NY 18244, USA* 


We begin with a discussion of a model with a flat extra dimension which 
addresses the flavor hierarchy of the Standard Model (SM) using pro- 
files for the SM fermions in the extra dimension. We then show how 
flavor violation and contributions to the electroweak precision tests can 
be suppressed [even with O(TeV) mass scale for the new particles] in 
this framework by suitable modifications to the basic model. Finally, we 
briefly discuss a model with a warped extra dimension in which all the 
SM fields propagate and we sketch how this model “mimics” the earlier 
model in a flat extra dimension. In this process, we outline a “com- 
plete” model addressing the Planck-weak as well as the flavor hierarchy 
problems of the SM. 


1.1. Introduction 


Extra dimensions is a vast subject so that it is difficult to give a complete 
review in 5 lectures. The reader is referred to excellent lectures on this 
subject already available such as references [1—4] among others. Similarly, 
the list of references given here is incomplete and the reader is referred to 
the other lectures for more references. 

We begin with some (no doubt this is an incomplete list) motivations 
for studying models with extra dimensions: 


(i) Extra dimensional models can address or solve many of the prob- 
lems of the Standard Model (SM): for example, the various hi- 


* After August 1, 2007: 

Maryland Center for Fundamental Physics, 
Department of Physics, 

University of Maryland, 

College Park, MD 20742, USA 
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erarchies unexplained in the SM — that between the Planck and 
electroweak scales [often called the “(big) hierarchy problem”] and 
also among the quark and lepton masses and mixing angles (often 
called the flavor hierarchy). We will show how both these problems 
are solved using extra dimensions in these lectures. 

Extra dimensional models can also provide particle physics candi- 
dates for the dark matter of the universe (such a particle is absent 
in the SM). We will not address this point in these lectures. 

(ii) Extra dimensions seem to occur in (and in fact are a necessary 
ingredient of) String Theory, the only known, complete theory of 
quantum gravity (see K. Dienes’ lectures at this and earlier summer 
schools). 

(iii) Although we will not refer to this point again, it turns out [5] 
that, under certain circumstances, extra dimensional theories can 
be a (weakly coupled) “dual” description of strongly coupled four- 
dimensional (4D) theories as per the correspondence between 5D 
anti-de Sitter (AdS) spaces and 4D conformal field theories (CFT’s) 
[6]. 


The goal of these lectures is a discussion of the theory and phenomenol- 
ogy of some types of extra dimensional models, especially their applications 
to solving some of the problems of the SM of particle physics. The main 
concept to be gleaned from these lectures is that 


e extra dimensions appear as a tower of particles (or modes) from 
the 4D point of view (a la the standard problem of a particle in 
1D box studied in quantum mechanics). 


The lightest mode (which is often massless and hence is called the zero- 
mode) is identified with the observed or the SM particles. Whereas, the 
heavier ones are called Kaluza-Klein (KK) modes and appear as new par- 
ticles (beyond the SM). It is these particles which play a crucial role in 
solving problems of the SM, for example they could be candidates for dark 
matter of the universe or these particles can cut-off the quadratically diver- 
gent quantum corrections to the Higgs mass. These particles also give rise 
to a variety of signals in high-energy collider (i.e., via their on-shell or real 
production) and in low-energy experiments (via their off-shell or virtual 
effects). This is especially true if the masses of these KK modes are around 
the TeV scale, as would be the case if the extra dimension is relevant to 
explaining the Planck-weak hierarchy. 
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Here is a rough outline of the lectures. In lecture 1, we begin with 
the basics of KK decomposition in flat spacetime with one extra dimen- 
sion compactified on a circle. We will show how obtaining chiral fermions 
requires an orbifold compactification instead of a circle. In lecture 2, we 
will consider a simple solution to the flavor hierarchy using the profiles of 
the SM fermions in the extra dimension. However, we will see that such a 
scenario results in too large contributions to flavor changing neutral current 
(FCNC) processes (which are ruled out by experimental data) if the KK 
scale is around the TeV scale — this is often called a flavor problem. Then, 
in lecture 3, we will consider a solution to this flavor problem based on the 
idea of large kinetic terms (for 5D fields) localized on a “brane”. Another 
kind of measurement of properties of the SM particles (not involving flavor 
violation), called Electroweak Precision Tests, will be also be studied in this 
lecture, including the problem of large contributions to one such observable 
called the T (or p) parameter. In lecture 4, we will solve this problem of 
the T parameter by implementing a “custodial isospin” symmetry in the 
extra dimension. We will then briefly discuss some collider phenomenology 
of such models and some questions which are unanswered in these models. 
Finally, we will briefly study models based on warped spacetime in lecture 
5, indicating how such models “mimic” the models in flat spacetime (with 
large brane kinetic terms) studied in the previous lectures. We will sketch 
how some of the open questions mentioned in lecture 4 can be addressed in 
the warped setting, resulting in a “complete” model. 


1.2. Lecture 1 


1.2.1. Basics of Kaluza-Klein Decomposition 


Consider the following 5D action for a (real) scalar field (here and hence- 
forth, the coordinates z” will denote the usual 4D and the coordinate y 
will denote the extra dimension): 


Spe [ate [aul (oe) (Oy ®) — Moo (1.1) 


Since gravitational law falls off as 1/r? and not 1/r? at long dis- 
tances, it is clear that we must compactify the extra dimension. Sup- 
pose we compactify the extra dimension on a circle (S1), i.e., with y un- 
restricted (—oo < y < oo), but with y identified with y + 27R.* We 


*Equivalently, we can restrict the range of y: 0 < y < 27R, imposing the condition that 
= 0 same as y = 27R. 
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impose periodic boundary conditions on the fields as well, i.e., we require 
®(y = 2r R) = (y). Then, we can (Fourier) expand the 5D scalar field as 
follows: 

1 n=+oo 


SO o™(aeinv/# (1.2) 


where the coefficient in front has been chosen for proper normalization. 
Substituting this expansion into Ssp and using the orthonormality of 
profiles of the Fourier modes in the extra dimension (i.e., e’”Y/”) to inte- 
grate over the extra dimension, we obtain the following 4D action: 
2 


Sap = / dY | (a0) (ae) - ( Mee z) og] 
(1.3) 


This implies that from the 4D point of view the 5D scalar field ap- 
pears as an (infinite) tower of 4D fields which are called the Kaluza-Klein 
(KK) modes: ¢‘ with mass?, m2 = M? + n?/R? (note that the n?/R? 
contribution to the KK masses arises from 05 acting on the profiles) [see 
Fig. 1.1(a)]. 

The lightest or zero-mode (n = 0) has mass M (strictly speaking it 
is massless only for M = 0). The non-zero KK modes start at ~ 1/R 
(for the case M « 1/R) which is often called the compactification scale. 
We can easily generalize to the case of 6 extra dimensions, each of which 


(a) St (b) S*/22 
MaD , n>0 n<0 cos (7#) sin (4#) 
Wear | 
M?+1/R? | 
M 4 Á — 
0 


Fig. 1.1. KK decomposition of a 5D scalar on a circle (a) and an orbifold (b), choosing 
even parity. 
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(1) Sy oN hte ne 
y=0 y=TR 


Fig. 1.2. Going from a circle to an orbifold using Z2 symmetry 


is compactified on a circle of same radius to obtain the spectrum: m2? = 


M? + ~_, n?/R2. However, in these lectures, we will restrict to only one 
extra dimension. 

Thus, we see that the signature of an extra dimension from the 4D 
point of view is the appearance of infinite tower of KK modes: to repeat, 
the lightest (zero)-modes is identified with the SM particle and the heavier 
ones (KK modes) appear as new particles beyond the SM. 


1.2.2. Orbifold 


Mathematically speaking, a circle is a (smooth) manifold since it has no 
special points. We can “mod out” this smooth manifold by a discrete sym- 
metry to obtain an “orbifold”. Specifically, we impose the discrete (Z2) 
identification: y > —y in addition to y = y + 2r R. Thus, the physical or 
fundamental domain extends only from y = 0 to y = TRP — this compacti- 
fication is denoted by S1/Z: see Fig. 1.2. 

The endpoints of the orbifold (y = 0,7R) do not transform under Z2 
and hence are called fixed points of the orbifold. Also, note that the end 
points of this extra dimension are not identified with each other either by 
the periodicity condition y = y + 27R (unlike the endpoints y = 0,27R on 
S1) or by the Z symmetry. 

Let us consider how the KK decomposition is modified in going from a 


circle to an orbifold. We can rewrite the earlier KK decomposition in terms 
of functions which are even and odd under y > —y: 


(x,y) = R” pe TR |e” cos 2 


T 
+ o_" sin w) (1.4) 


with the identification ¢Ẹ£7® = 1(1)/ V2 (™ + 4). 


bEquivalently, we can still pretend that it extends from y = 0 to y = 27R as before, 
but with the region y = 7R to y = 2rR not being independent of the region y = 0 to 
y =TR. 
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We must require the physics, i.e., S5p, to be invariant under y > —y. 
For this purpose, we assign an (intrinsic) parity transformation to ®: 


(x, —y) = P(x,y) (1.5) 


(m>0) S 


with P = 1, i.e., ® being even or odd. This assignment sets ¢ 
for P = +1 and ¢Ẹ” = 0 [including ¢®] for P = —1 see Fig. 1.1(b). 
Thus, a summary of orbifold compactification is that®: (i) it reduces the 
number of modes by a factor of 2 and (ii) it removes or projects out the 
zero-mode for the case of the 5D field being odd under the parity. 


1.2.3. Fermions on a Circle: Chirality Problem 


One possible representation of the 5D Clifford algebra for fermions: 
{T m, TN} = 2nun (1.6) 


is provided by the usual Dirac (4 x 4) matrices 


Pa = Yu, T; = —iys (1.7) 


Thus, we see that the smallest (irreducible) representation for 5D fermions 
has 4 (complex) components (cf. 2-component complex or Weyl spinor 
in 4D, where the 2 x 2 Pauli matrices form a representation of Clifford 
algebra). 

Consider the following 5D action for fermions 


Ssp = WV (iðmT” — M) Y (1.8) 


When the extra dimension is compactified on a circle, we can plug in the 
usual decomposition Ya=1-4 = >, Yh) einy/R to find the 4D action: 


Sap = Sb) (iy ð” — M — in/R) py (1.9) 


n 


Thus, we obtain a tower of Dirac (4-component) spinors from the 4D point 
of view: m? = M? +n?/R?: see Fig. 1.3(a). 


We will see later how an orbifold is “useful” in the case of 5D fermion/gauge fields 
because of these properties. 


Extra Dimensions T 


MaD , (a) S? (b) St/22 
Ur Ur Wy (even) Wr (odd) 

n>0 n<0 n>0 n<0 cos (4%) sin (4#) cos (4%) sin (4) 
2 
R — Å= =a =z 
EN 
R — — samme — 
0 L m --- e 254 E mmm NEEN fl EA 


Fig. 1.3. KK decomposition for a 5D fermion on a circle (a) and an orbifold (b) with 
even parity for Yz. 


Consider the case M = 0. We see that there are non-chiral massless (or 
zero) modes: explicitly, in the Weyl representation of Dirac matrices, i.e., 


ee (1.10) 


Op O 

1 0 
= 1.11 
w= (5-4) (1.41) 
on = (Ti=1..3, 1), (1.12) 


pia decomposes as ~ | Ola z 1,2), 0 (a = 3,4), where L (R) 
refers to left (right) chirality (or helicity) under the 4D Lorentz transfor- 
mation. The problem is that if the 5D fermion transforms under some 
5D gauge symmetry, then the L and R (massless) chiralities (zero-modes) 
transform identically under this gauge symmetry. Hence, such a scenario 
cannot correspond to the SM, where the fermions are known to be chi- 
ral, i.e., the left-handed (LH) and right-handed (RH) ones transform as 
doublets and singlets, respectively under the SU(2) wear gauge symmetry. 


1.2.4. Fermion Chirality from Orbifold 


We can obtain chiral fermions by compactifying the 5D theory on an orb- 
ifold instead of a circle as follows. Suppose we choose W; to be even under 
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the Z2 parity. Then, Up must be odd since the 5D action contains the 
term WI; > Vtr, which must be even so that the 5D action is 
Zə-invariant (note that ôs is odd under parity). 
We obtain the following decomposition: 
p ny,. n 
UL (R) oe. ie cos —= r (sin >) (1.13) 
Thus, (for case of the 5D mass, M = 0°) we get a massless zero-mode only 
for Uz (even field): see Fig. 1.3(b). Of course, we could have chosen Vz to 
be even instead to obtain a RH zero-mode. 


1.3. Lecture 2 


1.3.1. Zero-Mode Fermion Profiles 


We see that the massless (chiral) mode on an orbifold has a flat profile 
[see Eq. (1.13)]. So, if all the SM fermions have M = 0, then the extra 
dimension does not provide any resolution of the flavor hierarchy, i.e., we 
need to put hierarchies in 5D Yukawa couplings (similar to the situation in 
the SM) in order to obtain hierarchies in the 4D Yukawa couplings. 

We must then consider modifying the profiles of the fermion zero-modes 
in order to solve the flavor hierarchy problem using the extra dimension. 
We can try adding a bare mass term: VV = viR + h.c., but such a mass 
term breaks the Z2 symmetry (again since Y z£, g transform oppositely under 
the parity) . The solution to this problem [7] is to couple the 5D fermion 
to a Z2-odd scalar with the following 5D Lagrangian: 


Lsp = WV (iðmT™ — hð) Y 
+ (8m) -A (©? - V?) (1.14) 


The point is that the potential V(®) = \(®? — vey forces a vacuum 
expectation value (vev) for ® which is a constant in y in-between the end- 
points of the extra dimension (often called the “bulk”). However, such a 
vev tends to “clash” with ® = 0 at the endpoints (as required by the scalar 
being odd under the Zə parity). As a result, we obtain a (approximately) 
“kink-anti-kink” profile for the scalar vev (see references [7] for more de- 
tails) as in Fig. 1.4. Such a profile for the scalar vev is equivalent to adding 
a Z2-odd 5D mass for the fermion. The point is that with such a scalar vev 


dWe will see in the next section that only a “special” form of mass term is allowed on 
an orbifold. 
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profile 


> Viens | 


Fig. 1.4. Profile of odd mass term (dashed line) and fermion zero-mode (solid line). 
Here and henceforth, we set radius of extra dimension, R = 1 in all figures. 


we have a spontaneous breaking of the Z2 symmetry — recall that it is this 
Z2 symmetry which prevented us from writing such a mass term to begin 
with, i.e., a bare mass term would correspond to an explicit breaking of this 
symmetry. 

Let us then consider how the KK decomposition is modified in the pres- 
ence of such an (odd) bulk fermion mass term. The 5D action is 


S55 T fiðuT™ + Mely) Y (1.15) 


where e(y) = +1(—1) for tR > y > O(-7R < y < 0). It is easy to see 
that the eigenmodes are no longer single sin or cos, but instead are linear 
combinations of these basis functions. Hence, we have to work harder to 
obtain the eigenmodes. 


1.3.2. General Procedure for KK Reduction 


We will now take a slight detour to discuss the procedure to obtain the KK 
decomposition for a general 5D action and return to apply this procedure 
to the above 5D fermion case. 

For simplicity, consider a 5D scalar field decomposed into modes as 
follows: ®(x,y) = $, 6™(x)fn(y). Plug this expansion into the simple 
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5D action: 
Cee [te f a| (02) (Ou) — VEZ (1.16) 


We require that, after integrating over the extra dimension, we get 
we 
Dre / da Y | (3.6) (axe) - ( ewe 7) soe] 
(1.17) 


so that we can interpret ¢”)’s as particles (KK modes) from the 4D point 
of view. 

This requirement gives us the following two equations: matching kinetic 
terms in Sap of Eq. (1.17) to the 0, (or 4D) part of the kinetic term 
obtained from S5p gives us the following: 


(i) orthonormality condition 


/ dy f" (9) faly) = 1 (1.18) 


whereas matching the mass terms in S4p of Eq. (1.17) to the 5D mass 
term (M) and the action of 05 on the profiles in S5p gives us the 


(ii) differential equation: 


3y fay) — M? fry) = -mafa lY) (1.19) 


Thus the KK decomposition reduces to an eigenvalue problem, solving 
which gives us the KK masses (eigenvalues) mn and their profiles fn(y) 
(eigenfunctions). This is very reminiscent of solving the problem of 
Schroedinger equation for a particle in a 1D box in quantum mechanics. 

For the above simple case of a 5D scalar with a bulk mass, we get the 
following solutions to the differential equation [i.e., Eq. (1.19)]: fn(y) ~ 
etiv ma-M?y for m2 > M?. In addition, the periodicity condition, i.e., 
fr(y) = fnly + 2TR) requires \/m2 — M? = n? /R? so that m2 = M? + 
n? /R? (as before). The reader should think about the possibility m2 < M? 
(where we get exponentially rising or decaying profiles) to show that we 
cannot satisfy the continuity of derivative at y = 0,7R in this case and 
hence we cannot have such solutions for a scalar. 

The above procedure can be generalized to more complicated 5D actions 
and for other spin fields. 
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1.3.3. Solution to Flavor Puzzle 


Next, we return to the problem of the KK decomposition of a 5D fermion 
with the (odd) mass term and with Y; (p) being even (odd) under Z2 parity. 
As outlined above, we plug Yz r = Y™ (x) fL,R n(y) into Ssp to obtain the 
differential equations: 


| — ðs + Me(y)| fin = Mn fR (1.20) 
[ðs E Me(y)| fii = ate (1.21) 


Note that (as mentioned before) cos or sin are solutions only for M = 0, 
but not for M 4 0 [On a circle, the mass term M has no e(y) so that 
ÍL Ra ~ eY/R are indeed solutions.]. 

It is easy to solve for the zero-mode profile (m,, = 0) even for M 4 0 
(the m, Æ 0 case is difficult to solve due to the two differential equations 
being coupled): 


froly)=Ne™Y (0<y< rR) 
= Ne ™” (0 >y>-rR) (1.22) 


(N is a normalization factor: see exercise 1 in appendix). 

Note that for RH modes, fro ~ eF™Y solves the eigenvalue equation, 
but it clashes with vanishing of fr o(y) at y = 0, mR as required by Y pg being 
odd under Zə parity. Thus, as expected from the parity choice, there is no 
RH zero-mode. Note that there is a discontinuity in the derivative of fr o 
at y = 0,7R (Fig. 1.4), which precisely matches the e(y) term (cf. scalar 
case earlier where such profiles cannot satisfy the requirement of continuity 
of derivative at the fixed points). The point is that M # 0 still gives a 
massless fermion mode (unlike for a scalar). 

We will now see how the flavor hierarchy can be accounted for without 
any large hierarchies in the 5D theory: see exercise 1 and Fig. 1.5. For 
simplicity, suppose the SM Higgs field is localized at y = mR (each end of 
the extra dimension is often called a “brane”, motivated by String Theory) 
and add the following coupling of 5D fermions to it: 


Ssp > J tiedydly - 2B) HWW Asp (1.23) 


where W and W’ are two different 5D fermion fields which are SU(2)r 
doublets and singlets with M, M’ being their 5D masses, respectively. Note 
that UV; and Vp are chosen to be even under Zə so that they give the LH 
and RH zero-modes, respectively. Since Up and Y% vanish at the y = tR 
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mode profile 


1.4k 


T2TA 


Fig. 1.5. Profiles for down (dashed line: 5D mass, M = —2), strange (thin solid line: 
M = —1) and top quarks (thick solid line: M = +1). The SM Higgs is localized on the 
y = TR brane. 


brane, they do not couple to the Higgs as seen in Eq. (1.23). Plugging in 
the zero-mode profiles, we obtain the effective 4D Yukawa coupling, i.e., 
A4D Hy yO : 


Aap © Asp X fro(tR) fro(aR) 
x Aspe M) (1.24) 


Let us consider the hierarchy between the down (d) and strange (s) 
quark masses for example. For simplicity, we set A5p to be the same for 
d,s and also M = —M’ for each quark to obtain (up to small dependence 
of normalization on M’s) 


mad 2AMnrR 
~e 
Ms 


~ 1/100 which is the required, i.e., experimental value (1.25) 


so that AM = Ma— Ms, ~ —2 |for example, Ma = —3, Ms = —1] in units of 
1/(7R) suffices to obtain the hierarchy in 4D masses (or Yukawa couplings). 

The crucial point is that we did not invoke any large hierarchies in the 
5D or fundamental parameters (M or Asp), but we can still obtain large 
hierarchies in the 4D Yukawa couplings. 
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1.3.4. Intermediate Summary: Basic Concepts 


Before moving on, let us summarize: 


(i) A 5D field appears as a tower of KK modes from 4D point of view, 
with each mode having a profile in the extra dimension. 
(ii) The profiles and the KK masses are obtained by solving an eigen- 
value problem (or wave equations in 5D space-time). 
(iii) The coupling of particles (i.e., zero and KK modes) is proportional 
to the overlap of their profiles in the extra dimension. 


1.3.5. Gauge Field on a Circle 


Next, we consider 5D gauge fields with the following 5D action®: 


1 
S5p = J travi Fun" (1.26) 
1 
= i dadys (Fit +F F”) (1.27) 
with 
Am = Au + A5 (1.28) 


As usual, the KK decomposition is achieved by plugging in the ex- 
pansion Au, 5 = n ACK Fp, 5n(y) into Ssp. It is easy to see that this 
procedure is similar to that for a 5D scalar, up to the presence of Lorentz 
index and gauge fixing. It is straightforward to include the Lorentz index 
in the KK decomposition, but there are subtleties with gauge fixing — we 
will not go into details of the latter issue in these lectures (for a discussion 
of this issue, see, for example, 1st reference in [3]). 

The end result is that, on a circle, both A, and As components have 
zero-modes — the former is a vector, whereas the latter is a scalar from the 
AD point of view: see Fig. 1.6(a). 

Thus, we encounter a unification of spins in the sense that massless 4D 
scalars can be obtained from 5D gauge fields. If the 4D scalar Al? remains 
massless, then it will result in an extra long range force which would be 
ruled out by experiments. However, this scalar does acquire a mass from 


“Once the SM fermions propagate in the extra dimension, we can show that the SM 
gauge fields also have to do the same to preserve gauge invariance. 
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MAD 4 (a) S! (b) S!/Z2 


Ap ~ cos (FP) Ay ~ sin (4) 


n>0 n<0 As ~ sin (44) As ~ cos (7) 
E —— — 
11 ————— —_ 


Fig. 1.6. KK decomposition for a 5D gauge field on a circle (a) and on a orbifold (b) 
with choice of even parity for Ap. 


loop corrections (see lecture 5) so that such a light scalar (almost zero- 
mode) might not be a robust problem (unlike the chirality problem with 
fermions on a circle). 


1.3.6. Gauge Field on an Orbifold 


In any case, it is possible to get rid of the As zero-mode using orbifold 
compactification as follows. Notice that for 


Fus = OAs — Os Ap (1.29) 
to have a well-defined Zə parity, we have two choices: 


i) A, is even — it has a zero-mode which is identified with the SM 
H 
gauge boson — which implies that As is odd and so does not have 
a zero-mode [see Fig. 1.6(b)] or 


(ii) A, is odd (no zero-mode gauge boson) so that As is even and has 
a zero-mode. 


As we will see later, the As zero-mode in case (ii) can play the role of 
SM Higgs, but for now, we will make the choice (i), i.e., A, (5) is even (odd) 
so that we do have a zero-mode (i.e., SM) gauge boson. 
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Hence, we obtain the following KK decomposition for this gauge field 
on an orbifold [Fig. 1.6(b)]: 


1 
= — (i.e. a flat profile 1.30 
fu 0 VITR ( P ) ( ) 

1 
7 = cosny/R 1.31 

1 
51 = sin ny/ R 1.32 
We have normalized the modes over —7R < y < +7R, even though the 
physical domain is from y = 0 to y = mR. We can show that Alte) “eats” 
A” to form a massive spin-1 gauge boson from the following mass terms 
Fis 2 OyAsOsA" (1.33) 
~ Aa AS” ayfa nly) (1.34) 


These mass terms mixing A) and Al”) are similar to the ones in the SM: 
W,,0"H(H) (which indicate that the longitudinal polarization of W is the 
unphysical component of Higgs, i.e., the equivalence theorem). 


1.3.7. Couplings of Gauge Modes 


We now calculate the couplings of the various gauge modes to the matter 
particles (in this case fermions) based on their profiles. We can show that 
the coupling of zero-mode is the same to all fermion modes (whether zero 
or KK): 


f eeayar™ (ðm + gsAm) Y > Yyyy x J wR wo 


ð, + AVE ) 1.35 
( # E STR ( ) 
= EPE (By Hga AD) 

(for all n) (1.36) 


with 


ga (or gsm) = AE (1.37) 


The point is that the profile of the gauge zero-mode is flat so that the 
overlap integrals appearing in the kinetic term for fermion mode and in 
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the coupling to gauge zero-mode are identical. This universality of the 
zero-mode gauge coupling is actually guaranteed by 4D gauge invariance. 

However, the couplings of zero-mode fermions to gauge KK modes (com- 
ing from the overlap of profiles) are non-universal, i.e., these couplings de- 
pend on the 5D fermion mass (see Fig. 1.7): 


a(n, M) = gs I dy (Ne~™9)? x fu n(y) (1.38) 
= g4 X a(n, M) (1.39) 


where a is an O(1) quantity (see exercise 1). The reason is that the gauge 
KK profile is not flat (unlike for zero-mode) or equivalently there is no 
analog of 4D gauge invariance for the massive (KK) gauge modes. 


mode profile 


-0.5; FR 


Fig. 1.7. Profiles for down (thick dashed line) and strange (thin solid line) quarks and 
the gauge zero-mode (thick solid line) and 1st KK mode (thin dashed line). The SM 
Higgs is localized on the y = mR brane. 


1.3.8. Flavor Problem from Gauge KK Modes 


Such non-universal couplings of gauge KK modes to fermion zero-modes 
results in flavor violation as follows [8]. The point is that the couplings 
of the gauge KK modes to zero-mode fermions are flavor diagonal, but 
non-universal in the interaction (or weak) basis: 


7 £ ad 0 n d 
ga (dLweak SLweak) ( 0 A gear } & a) (1.40) 
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which results in the appearance of flavor violating couplings after a unitary 
rotation to the mass basis: 


..gaD' diag (aa, as) DL... — g4 (as = aa) (DL)ia X 


dLmass 7" AW” sr mass (1.41) 


where Dz is the unitary transformation to go from the interaction (or weak) 
basis to the mass basis (for left-handed down-type quarks). 

Hence, we obtain a contribution to, for example, K — K mixing ampli- 
tude: 


(as — aa)? (Dr )îo (1.42) 


The SM contribution to K — K mixing amplitude has a suppression mech- 
anism (see below): 


2 


Me 
tau (VusVua)” (1.43) 
WwW 


Msm ~ 


where Vus, ua are the Cabibbo-Kobayashi-Maskawa (CKM) mixing angles. 
Since the data agrees with the SM prediction, we must require the KK 
contribution to be smaller than the SM one and hence we can set a bound 
on the KK mass. Using 


(as — aa) ~ O(1/10) (1.44) 


(see exercise 1), i.e., the fact that the couplings of gauge KK modes to down 
and strange quarks are O(1) different, we get 


Mxx © 20 TeV (1.45) 


assuming that the the Dz mixing angles are of order the CKM mixing 
angles. Such a large KK mass scale could result in a tension with a solution 
to the Planck-weak hierarchy problem: we would like the KK scale to be 
~ TeV for this purpose (we will see later how the KK mass scale is related 
to the EW scale). 

For completeness, we briefly review FCNC’s in the SM below. We begin 
with the transformation of quarks from weak to mass basis. The Yukawa 
couplings of the SM fermions to the Higgs (or the mass terms) are 3 x3 com- 
plex matrices (denoted by Ma in the down quark sector) in the generation 
space. Such matrices can be diagonalized by b¢unitary transformations, 
Dr,r. For simplicity, consider the 2 generation case (this analysis can be 
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easily generalized to the case of 3 generations), where this transformation 
can be explicitly written as 


i i d 
(dLweak STweak) (Ma)2x2 ( oe) 


SR weak 
l 5 diag. / dr mass 
= (dLmass Stmass) Ma lag ( se) 
SR mass 
(1.46) 
where 
a A; 
( L,R aed = DLR & | hap 
SL,R weak SL,R mass 


Mies. = pt MaDr 
Md 0 
= 1.4 
Cay (1.48) 
There are no tree-level FCNC in the SM since the gluon, y and Z vertices 
preserve flavor in spite of the above transformations. Of course, the reason 


is that the couplings of gluon, y and Z in the weak (or interaction) basis 
are universal. Explicitly, 


1 1., -= 7 10 d 
92 (-5 t3 sin? aw ) (di weak SLweak) 247” d ( $ pa 


SL weak 
= 7 10 dr mass 
=... (drmass Simass) Z," Di, l i) Di, te Ta 
Sete 5 di, mass Zu" dz mass (1.49) 
i=d,s 


as compared to Eqs. (1.40) and (1.41). 
However, the charged current (W) couplings are non-diagonal in the 
mass basis: 


J Z 10 d 
V2 (ütweak ČLweak) wir" te o ( 4 weak ) 


SL weak 
2 > 10 d 
= ...(Uimass CLmass) Wiyttt 3 Dz ( a 
SL mass 
= 5 a, mass W” Y, Vorm idi, mass (1.50) 


i=u,c j=d,s 
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where the CKM matrix 
VoKm = Ut} Dr 
Al (1.51) 


since the transformations in the up and down sectors are, in general, not 
related. Hence, the charged currents do convert up-type quark of one gen- 
eration to a down-type quark of a different generation. So, we can use 
the charged current interactions more than once, i.e., in loop diagrams, to 
change one down-type quark to another down-type quark, for example, to 
obtain a AS = 2 process via a box diagram. 

Naively, we can estimate the size of this box diagram 


d'k 1 1 1 
Msm ~ oi | ——Vexm isVOKM js VCKM id VCKM ja r [—_ > 
(27) i 1t Y-mi K-m; k? — M2, 
ma g (Vas Vea) = 
1672 M2, (1.52) 


(neglecting m; j in the up quark propagators: more on this assumption 
below) which turns out to be too large compared to the experimental value! 

However, this is where the Glashow-Iliopoulos-Maiani (or GIM) mech- 
anism comes in. Using the unitarity of the CKM matrix, 


XO V}Va = 0, (1.53) 


we find that M sm vanishes if m; = m,, in particular if we neglect the quark 
masses as we did above. Hence, the amplitude must be proportional to the 
non-degeneracy of the up-type quark masses, i.e., for the two generation 
case we find that 


Den u (1.54) 


which was used earlier in Eq. (1.43). The point is that we get an extra 
suppression of ~ m?2/M?, ~ 1074 compared to the naive estimate in 2nd 
line of Eq. (1.52). 


1.4. Lecture 3 


As we saw in the previous lecture, the extra dimensional model which ad- 
dresses the flavor hierarchy does not have analog of the GIM suppression 
in the gauge KK contribution to flavor violation. The reason is that the 
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couplings of the strange and down quarks to the gauge KK modes, denoted 
by asa (in units of g4), are O(1), and different. 

In order to solve this problem, we would like to modify the gauge KK 
profile, for example, a more favorable picture would be as in Fig. 1.8, where 
gauge KK modes are localized near the y = mR brane whereas light fermions 
are localized near the y = 0 brane as usual. The point is that in this 
case couplings of fermions to the gauge KK modes (even though still non- 
universal) are < 1 (in units of g4) so that the FCNC’s are suppressed. 
So, the question is how to modify KK decomposition in general and, in 
particular, how to obtain the profiles as in Fig. 1.8. 


mode profile 


-0.5} “T7457 - 


Fig. 1.8. Same as Fig. 1.7, but with brane kinetic term, r/R = 10, for gauge fields on 
y = 0 brane. 


1.4.1. Brane Kinetic Terms 


We consider a modification to the extra dimensional model by adding in- 
teractions for the 5D gauge fields which are localized at the fixed points 
(branes). The point is that such interactions are allowed for an orbifold, 
but not on a circle, where there are no such “special” points in the extra 
dimension. In fact, consistency of the model at the quantum level requires 
the presence of such terms since such terms are generated by loops even if 
they are absent at tree-level [9]. 
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Specifically, we study the Lagrangian: 
Lsp = >i [Fun FM + 5(y)r Fun FH] 
+Ù (ðm +gsAm) T” Y (1.55) 
Simple dimensional analysis gives [An] = 3/2, |Y] = 2, [øs] = —1/2 


(here keal denotes mass dimension) so that the brane kinetic term has mass 
dimension —1 (i.e., it has dimension of a length) and is therefore denoted 
by r. 

It is sometimes convenient to use a different normalization for Am: 
Am > Âm /gs in terms of which the action is: 


1 
Lsp = =al FunFMN +ôly i Fw Ft” 
98 


+0 (ou m Âm) rp (1.56) 


With this normalization, we have [Au] = 1 (as in 4D) so that the brane 
kinetic term is dimensionless: we can then define a brane-localized “cou- 
pling” as 1/ Ibrane = r/93. 

We will now study how the KK decomposition is modified in the pres- 
ence of these brane kinetic terms. Consider the case of a scalar field for 
simplicity (the gauge case which we are really interested in is similar). Here, 
we will only give a summary: for details, see exercise 2 and reference [10] 
for example. 

Following the procedure outlined in lecture 2, we find that the orthonor- 
mality condition is modified (relative to the case of no brane terms): 


J 0R a [LE rW] = Sm (1.57) 


and the profiles and mass eigenvalues are given by solving the differential 
equation: 


o? + my + rò(y)m?] fn(y) =0 (1.58) 


The solutions f(y) of this equation are linear combination of sin and cos, 
in particular, a different one for y = 0 to y= TR and y = -TR toy = 0. 

In addition, in order to solve for the coefficients of sin, cos in these 
linear combinations, we must impose conditions such as continuity of fn(y) 
at y = 0, periodicity of f,(y) and matching the discontinuity in derivative 
of fr(y) to 6(y) in Eq. (1.57). 
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1.4.2. Couplings of gauge modes 


It turns out that the zero-mode of the gauge field continues to have a flat 
profile: only its normalization affected by brane term such that 


95 
ee 1.59 
we Vr+2rR ( ) 


For large brane kinetic terms, 


g5 
x 1.60 
g4 Vr ( ) 
Let us now consider couplings of gauge KK modes to particles localized 

on the branes in the limit of large brane terms. We find that 


(i) the coupling of gauge KK mode to a particle (say light SM fermion) 
localized at y = 0 is suppressed (compared to zero-mode): gs x 
fn(0) ~ ga/V/7/R. 

(ii) Whereas, the coupling to particles (such as the Higgs) localized at 
y = TR is enhanced compared to the zero-mode (or SM) gauge 
coupling : gs X fn(@R) ~ g4 x r/R 


The intuitive understanding is that large brane kinetic terms “repel” 
gauge KK mode from that brane (see Fig. 1.8). 


1.4.3. Solution to Flavor Problem 


In reality, the light SM fermions are not exactly localized at the y = 0 brane, 
but we find a similar suppression in their coupling to gauge KK mode for 
the actual profiles of the light fermions which are exponentials peaked at 
y = 0. Hence, based on the rough size of the coupling mentioned in point 
(i) above, we can show that FCNC’s from exchange of gauge KK modes are 
suppressed by a factor of r/R relative to the case of without brane kinetic 
terms, i.e., large brane kinetic terms provide an analog of GIM suppression 
in the SM. 

One might wonder if we are introducing a new hierarchy since we need 
r/R >> 1. However, that’s not really the case since a mild hierarchy of 
O(10) is enough. In fact, we will see in lecture 5 how we can effectively 
obtain the same effect as that of such large brane kinetic terms in a warped 
extra dimension without introducing any brane terms and therefore any 
hierarchy in the 5D theory at all. 
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1.4.4. Electroweak Precision Tests 


Having seen how to suppress contributions of the gauge KK modes to 
FCNC’s, we will now consider their contributions to flavor-preserving ob- 
servables called electroweak precision tests (EWPT). There are 3 such ef- 
fects which we discuss in turn. 


1.4.4.1. 4-fermion operators 


Tree-level exchange of gauge KK modes also generates flavor-preserving 4- 
fermion operators, Fig. 1.9. We can compare these effects to SM (i.e., zero- 
mode) Z exchange which has coefficient ~ g?,/m?, and use the fact that 
the experimental data on these operators agrees with the SM prediction at 
the ~ 0.1% level. For r = 0 (no brane term), we found that gauge KK 
coupling ~ V294 for fermions localized at y = 0 (recall that light fermions 
are localized near y = 0) so that we obtain a limit of mgg Z a few TeV. 
However, for large brane kinetic terms, the gauge KK couplings and hence 
the coefficients of these operators are further suppressed by a factor of 
~ r/R so that mgg ~ TeV is easily allowed by the data. 

The other 2 effects originate from the mixing of zero and KK modes for 
W, Z via the Higgs vev which we now discuss. The gauge group in the bulk 
is SU(2), x U(1)y. We first perform the KK decomposition (i.e., obtain 
zero and KK modes) for W;=1,2,3 and B (hypercharge) setting v = 0. At 
this level, there is no kinetic or mass mixing between these modes. 


Fig. 1.9. 4-fermion operators generated by exchange of zero and KK modes of Z. 
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Next, we turn on the Higgs vev. For v 4 0, we obtain masses for zero- 
modes of B and W; and mass mizing between W3 and B zero-modes (as 
in the SM). We define photon and Z zero-modes, z® and A®, to be 
combinations of wi and B®) such that the zero-mode mass mixing is 
diagonalized (as in the SM). We first define the zero-mode gauge couplings 
(we neglect the brane terms for simplicity here, but it is straightforward 
to include them): gwo) = gs 2/V27R, gz) = 95 z/V27R, where (92 7 = 
93+ 9%"). The weak mixing angle between w and BO), i.e., sin? yw is 
the ratio of these zero-mode gauge couplings. 

It turns out to be convenient to define the KK modes, Z™® and A(”) 
(n #0), using same (0-mode) mixing angles. The reason is that with this 
definition, the KK photon modes Aw do not couple to Higgs (just like 
zero-mode) and hence decouple from the other modes. 

However, the crucial point is that the W= zero mode mixes with the 
KK modes of W= via mass terms coming from the Higgs vev localized at 
y = TR (similarly for Z). Therefore, the mass eigenstates, i.e., SM W= 
and Z, are admiztures of zero and KK modes. To understand this effect, 
we can diagonalize the 2 x 2 mass matrix (for zero and 1st KK mode) for 
simplicity (see exercise 3). 


1.4.4.2. Shift in coupling of SM fermions to Z 


The above zero-KK mode mixing for W, Z induced by Higgs vev results in 
a shift in the coupling of SM W, Z to a fermion localized at y = 0 from 
the pure zero-mode coupling, i.e., SM Z has a (small) KK Z component 
so that gz = gz) + ôgz. We can estimate this effect via mass insertion 
diagrams as in Fig. 1.10 which are valid for v x couplings < mxx to find 
692/9z0) ~ Ize)" /MkK: see exercise 3 for a more accurate calculation. 
Note that there is no enhancement in gz for large brane kinetic terms 
(r/R > 1). The point is that the enhancement in the coupling (relative to 
the zero-mode coupling) at the Higgs-KK Z vertex cancels the suppression 
in the coupling at the fermion-KK Z vertex (cf. the effect on the W, Z 
masses below). Just like the case of 4-fermion operators, the measured 
couplings of SM fermions to Z agree with the SM prediction at the ~ 0.1% 
level so that we obtain a limit of mgg Z a few TeV. 


1.4.4.3. Shift in ratio of W and Z masses or p parameter 


The mixing of zero and KK W modes induced by the Higgs vev also results 
in a shift in SM W mass from the pure zero-mode mass (a similar effect 
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(n) (n) 
Z™ or W ZO or WO 


Fig. 1.10. Shift in the coupling of a SM fermion to SM Z from the zero-mode gauge 
coupling due to the mixing of zero and KK modes of Z. 


also happens for SM Z) as in Fig. 1.11: 


My = Mi, + dMy, where (1.61) 
1 
Myo = {Ivor (1.62) 
4 
v r 
My ~ Iwo —— 5 (1.63) 
w Myg R 


Me 9%} 
p= Œ x2 1.64 
MZ 9% (gS 


The point is that p = 1 in the SM (at the tree-level) and Apeapt. = Peapt. — 
1 ~ 1078. Actually, there is a subtlety in this definition for the 5D model 
due to the fact that the couplings of the Z boson to the SM fermions are also 
modified from the pure zero-mode Z coupling: gz = gzo) +6gz. However, 
as we discussed earlier, dgz, w are not enhanced by r/R > 1 so that we 
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Fig. 1.11. Shift in the masses of SM W, Z from the zero-mode masses due to the mixing 
of zero and KK modes. 


can set gz © gz) in Ap to find 
2 
= 2 2 v a 
5p =p-1~ (970) — wo) Fee R (1.65) 


The crucial point is that Ap is enhanced by the presence of large brane 
kinetic terms such that we must require mK K Ž 10 TeV for r/R ~ 10 (as 
needed to solve the flavor problem). 


1.5. Lecture 4 


In this lecture, we will show how to solve the problem of large corrections 
to the p parameter discussed in lecture 3. For this purpose, we have to 
introduce a “custodial isospin” symmetry in the extra dimension. We will 
then discuss some signals of this extra dimensional scenario. 


1.5.1. Custodial Isospin in SM 


We will first review why p = 1 in the SM at the tree-level. The starting 
point is that the Higgs potential, V(|H]|) in the SM with the complex doublet 
Higgs written as 


H = (hy, ha, hg, ha) (1.66) 


has a global SO(4) symmetry (corresponding to rotations among the 4 
real fields, h;). Moreover, SO(4) is isomorphic to SU(2) x SU(2) — one of 
these SU(2)’s in fact corresponds to the usual gauged SU(2)z group and 
the other one is usually denoted by SU(2)r. The crucial point is that the 
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global symmetry of the Higgs potential is enhanced compared to the gauged 
SU(2)z symmetry. The Higgs vev: 


(H) = (0, 0,0, v) (1.67) 


breaks the global SO(4) symmetry of the Higgs sector (in isolation) to 
SO(3) — the gauged SU(2); symmetry is broken in this process so that the 
W+ gauge bosons acquire masses. The unbroken SO(3) symmetry (which is 
global) is isomorphic to an SU(2) — clearly this unbroken SU (2) is the diag- 
onal subgroup of the 2 original SU(2)’s and is often called custodial isospin. 
It is this remnant symmetry which enforces equal masses for W£ 2,3" 

Of course, W} only mixes with B (there is no mixing for W#). This 


mixing results in the neutral mass, Mz? = 1/4 v? (03 +9'°), not being 
equal to the charged mass, Mj, = 1/4 v2g3. That is the reason why there 
is a factor of g? /g2 in the definition p = Mj, /M2 97/93: this factor takes 
the “violation of custodial symmetry” due to the gauging of hypercharge 
into account. 


1.5.2. Custodial Isospin Violation in 5D 


Based on the above discussion, the sizable Ap in the 5D model signals 
violation of custodial isospin symmetry somewhere in the 5D theory. First 
we begin with identifying the precise origin of custodial isospin violation and 
then we will come up with a solution to this problem. As we saw in lecture 
3, Ap from gauge KK modes « (930) — g% ) ~ go just as in the SM. 
So, the origin of large Ap or custodial isospin violation seems to be similar 
to that in the SM, i.e., it is due to gauging of hypercharge and the resulting 
mixing of Ws with B. However, the point is that there are additional mixing 
effects (compared to the SM) in the 5D model due to the presence of KK 
modes (the mixing of zero-modes amongst each other is same as in the SM). 
In particular, we? — B™ mixing occurs only in neutral sector and has no 
charged counterpart, whereas w = w mixing is symmetric between 
charged and neutral sectors. 

The origin of this dichotomy between charged and neutral sectors is the 
fact that the symmetry gauged in 5D is same as in the SM, i.e., SU(2)z x 
U(1)y, so that we have KK modes only for wẹ = and B: there are no 
charged partners for the B KK modes. This new effect (the custodial 
isospin violation due to B KK modes) is not taken into account by the 


factor of g?/g3 in the definition of p — the point is that this factor only 
0) 


accounts for the mixing only amongst zero-modes, i.e., the wi — Bi 
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mixing. To repeat, we — n) mixing does have a counterpart in the 


charged sector. Moreover, wi —B™ mass term ~ gwo) gf X fn(tR)v? ~ 
Iwo gpov?r/r/R so that this effect is enhanced for large brane terms! 


1.5.3. Custodial Isospin Symmetry in 5D 


It is clear that we need eztra charged KK modes to partner B™® if we 
wish to suppress Ap. We can achieve this goal by promoting the hy- 
percharge gauge boson to be a triplet. Hence, we can restore custodial 
isospin symmetry in the 5D model by enlarging the 5D gauge symme- 
try to SU(2)r, x SU(2)r [11]. It turns out that we need something like 
SU(2), x SU(2)r x U(1)B-z to obtain the correct fermion hypercharges as 
follows. Hypercharge is identified with a subgroup of U(1)r and U(1)p_z: 
Y = T3r + (B — L)/2, with Tər = +1/2 for (u,d)r and (v,e)r and 
B — L = 1/3,-1 for q, l (it is easy to check that this reproduces the 
SM hypercharges). Note that we still have extra neutral KK modes from 
U(1)s-z (which have no charged counterpart), but these KK modes do 
not couple to Higgs since the Higgs has B — L charge of zero: only KK 
We R couple to Higgs such that the KK exchanges which give the shifts in 
masses respect custodial isospin (i.e., they are the same in the charged and 
the neutral channels). 

Of course, we must break SU(2)r x U(1)p_1 down to U(1)y, i.e., we 
must require that there are no zero-modes for WẸ and the extra U(1) which 
is the combination of U(1)r and U(1)g-z orthogonal to U(1)y. However, 
this breaking must (approximately) preserve degeneracy for (at least the 
lighter) Wẹ and WẸ modes such that Ap continues to be (at least approx- 
imately) protected. It is clear that for this purpose we require degeneracy 
in both the mass of these modes and their coupling to the Higgs. This 
might seem to be challenging at first, but note that, for large brane kinetic 
terms (r/R > 1), KK modes are localized near y = mR. Therefore, if we 
break custodial isospin on the y = 0 brane, then the degeneracy between 
W3 and Wẹ is not significantly affected by this breaking. Specifically, we 
write down a large mass term for Wp and the extra U(1) at y = 0 which 


can originate from a localized scalar vev (different from the SM Higgs). We 
can show that this is equivalent to requiring vanishing of these gauge fields 
at y = 0 (odd or Dirichlet boundary condition: section 3.3 of reference [2]). 
This illustrates the general idea that breaking a 5D gauge symmetry by a 
large mass term localized on a brane is equivalent to breaking by boundary 
condition. 
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1.5.4. Signals 


Let us consider some of the signals of this extra-dimensional set-up. A 
quick glance at Fig. 1.8 tells us that the coupling of gauge KK modes to 
top quark is enhanced compared to the SM couplings, whereas the couplings 
to the light SM fermions are suppressed (all based on the profiles for these 
modes). 

We begin with real production of gauge KK modes, for example, the 
KK gluon. Due to the ~ TeV mass for these particles, it is clear that 
we have to consider such a process at the Large Hadron Collider (LHC). 
Based on the above couplings, we typically find a broad resonance decaying 
into top pairs making it a challenge to distinguish the signal from SM 
background. It turns out that due to a constraint from a shift in the 
Z — bb coupling,’ we cannot localize bz, and hence its partner tz, too close 
to the Higgs brane, forcing us to localize tz near the Higgs brane in order 
to obtain the large top mass. Hence the KK gluon dominantly decays to 
RH top quark. We can use this fact (and noting that the SM tt production 
is approximately same for LH and RH top quarks) for the purpose of signal 
versus background discrimination [12]. It is easy to distinguish this signal 
for the extra dimension from SUSY: there is no missing energy (at least in 
this process) and top quark is treated as “special” in the sense that it has a 
larger coupling (than the other SM fermions) to the new particles, namely 
KK modes, unlike in SUSY. 

We can also consider virtual exchange of gauge KK modes. 


(i) In analogy with the shift in the coupling of SM fermion to the Z 
that we considered earlier, we see that ttZ is shifted compared to 
the SM prediction (or compared to tiuZ and GcZ) since top quark 
(up quark) is localized near y = mR (y = 0) brane. Such an 
effect can be easily measured at the International Linear Collider 
(ILC) [13]. 

(ii) From the above discussion, it is clear that the couplings of the top 
and charm quarks to the KK Z are diagonal, but not universal in 
the weak or interaction basis. Once we rotate to the mass basis, 
there is a flavor violating coupling to KK Z to the top and the 
charm quark. In turn, this effect induces a flavor violating coupling 


fThis shift in the coupling originates from diagrams similar to the ones we considered 
earlier for the shift in coupling of SM fermion to the Z: see Fig. 1.10. Such shifts 
are enhanced if SM fermion is localized near y = mR brane, where gauge KK mode 
is peaked. 
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of the SM Z to the top and charm quarks (via mixing of KK and 
zero-mode Z), resulting in a flavor violating decay of the top quark: 
t — cZ. Such decays can be probed at the LHC [14]. 


1.5.5. Summary of Model and Unanswered Questions 


So, far we have considered a model with the SM gauge and fermion propa- 
gating in the bulk of a flat extra dimension, with the Higgs localized on or 
near one of the branes. The other SM particles (gauge bosons and fermions) 
are identified with zero-modes of the corresponding 5D fields. 

We have seen that a solution to the flavor hierarchy of the SM is possible 
using profiles for the SM fermions (again, these are the zero-modes of the 
5D fields) in the extra dimension; in particular, top and bottom quarks 
can be localized near the Higgs brane, whereas the 1st and 2nd generation 
(or light) fermions can be localized near the other brane. Moreover, the 
resulting flavor problem due to non-universal couplings of gauge KK modes 
to the SM fermions (for a few TeV KK scale) can be ameliorated with large 
brane kinetic terms for 5D gauge fields on non-Higgs brane (i.e., where the 
light fermions are localized). 

We also studied constraints from electroweak precision tests on this set- 
up and found that these constraints can also be satisfied for mgg ~ TeV, 
provided there is a custodial isospin symmetry in the bulk to protect the 
observable related to the ratio of W/Z masses (the p parameter). 

This set-up still leaves some questions unanswered: 


(i) We have assumed so far that mgg ~ TeV, but why is it < Mp,.? 

(ii) Is there a mild hierarchy problem associated with having large 
brane kinetic terms? Moreover, it seems a bit arbitrary that such 
terms appear only at y = 0 brane (where light SM fermions are 
localized) and not at y = TR. 


We will see in the next lecture that both these questions can be answered 
by using a warped geometry (instead of flat extra dimension). 
Furthermore, 


(iii) Why does Higgs have a negative (mass)? or why does electroweak 
symmetry breaking (EWSB) occur? What sets this mass scale? 
Specifically, can the hierarchy my « Mp; be due to some dynamics 
giving my ~ mgg which, in turn, is ~ TeV? 

(iv) Why is the Higgs localized on or near one of the branes? 
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These questions will be answered by a combination of Higgs being As, 
i.e., the 5th component of bulk gauge field and warped geometry. 


1.6. Lecture 5 


In this lecture, we will be brief: for details and a more complete set of 
references, see the excellent set of lectures by Sundrum [3]. 


1.6.1. Warped Extra Dimension (RS1) 


We begin with a review of the original Randall-Sundrum model (RS1) [15]: 
see Fig. 1.12. It consists of an extra-dimensional interval (y = 0 to 7R as 
before), but with the gravitational action containing a bulk cosmological 
constant (CC) and brane tensions (localized or 4D CC’s): 


Ssp = I dtzdyv—det G (MRs — A) 
Sbrane ba == pee V —det 91, 211, 2 (1.68) 


with guv 1(£) = Guu (£, y = 0) and guv 2(£) = Guu (£, yY = TR), where guv’sS 
are the induced metrics on the branes and Gmn is the bulk metric. Also, 
Ms is the 5D Planck scale and Rs is the 5D Ricci scalar. 

With the following two fine-tunings: 


Ti = -T = 24kMè, 1.69 
5 


where the (curvature) scale, k is defined using A = 24k? M3, we obtain a 
flat (or Minkowski), but y-dependent 4D metric as a solution of the 5D 
Einstein’s equations: 


(ds)? = e 7kYn (da) (da)” + (dy)? (1.70) 


Thus, the geometry is that of a slice of anti-de Sitter space in 5D (AdSs). 
The y-dependent coefficient of the 4D metric, i.e., e~*Y is called the “warp 
factor”. 

4D gravity: The 4D graviton (which is the zero-mode of the 5D gravi- 
tational) corresponds to fluctuations around the flat spacetime background, 
i.e., go) (£) S Quy + h£) (x). As usual, we plug this fluctuation into the 5D 
action and integrate over the extra dimensional coordinate to find an effec- 
tive 4D action for g® (x): 


M3 
Sap = ae (1- caer) | dtoy—det gORalg] (1.71) 
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Lp Tù 
4 4 
Planck/UV brane TeV/IR brane 
Th ey —A (bulk CC) —~ fF 
(brane tension) (brane tension) 


ds? = e huh” dr dry + dy? 


— Higgs 
y=0 y w y=TR 
M, eff. ~ Ms My eff. ~ Me™ M, eff. ~ Mge~**F 
~ Mpi ~ TeV 
Fig. 1.12. The Randall-Sundrum (RS1) model. 
from which we can deduce the 4D Planck scale: 
M3 
M2, = 5 (1 i ee) 
k 
M3 
x Gis forkR>1 (1.72) 


We choose k Š Ms so that the higher curvature terms in the 5D action 
are small and hence can be neglected. Thus, we get the following order of 
magnitudes for the various mass scales: 


k Š Ms Š Mp; ~ 1088 GeV (1.73) 


It turns out that the 4D graviton is (automatically) localized near y = 
0 (which is hence called the Planck or UV brane) - that is why the 4D 
Planck scale is finite even if we go to the decompactified limit of R — oo 
in Eq. (1.72). Specifically, its profile is ~ e7?*Y, 


1.6.2. Solution to Planck-Weak Hierarchy 


The motivation for the RS1 model is to solve the Planck-weak hierarchy 
problem. Let us now see how this model achieves it. Assume that a 4D 
Higgs field is localized on the y = mR brane which is hence called the TeV 
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or IR brane: 
SHiggs = IRET 92 sina. Odo 
a (UH 8) a 


where the natural size for vp is the 5D gravity or fundamental scale (Ms). 
Using the metric induced on the TeV brane, guv2 = Gury wR) = 


go) e kr the action for the Higgs field becomes 


e~4krR) (H|? — v2)” | (1.75) 


Now comes the crucial point: we must rescale the Higgs field to go to 


canonical normalization, H = He’™®, which results in 


N (iå? - ye G ] (1.76) 


Note that the Higgs mass is “warped-down” to ~ TeV from the 5D (or 
the 4D) Planck scale if we have the following modest hierarchy between 
the radius (or the proper distance) of the extra dimension and the AdS 
curvature scale. 


krR ~ log (Mpı/TeV) 
~ 30 or 
Re (1.77) 


Moreover, the quartic coupling is unchanged and hence the Higgs vev (or 
weak scale) is also at the TeV scale, assuming A ~ O(1). 

Note that the radius of the extra dimension is not a fundamental or 5D 
parameter, rather it is determined by the dynamics of the theory. Hence, 
in order to have complete solution to the hierarchy problem (without any 
hidden fine-tuning), we must show that the radius can be stabilized at 
the required size without further (large) fine-tuning of parameters of the 
5D theory. In fact, stabilization of such a radius can be achieved using a 
bulk scalar (Goldberger-Wise mechanism) [16], provided we invoke a mild 
hierarchy M?/k? ~ O(1/10), where M is the 5D mass of the scalar. 
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Thus, we see that the Planck-weak hierarchy can be obtained from O(10) 
hierarchy in the fundamental or 5D theory! In general, a large (“expo- 
nential”) hierarchy for the 4D mass scales can be obtained from a small 
hierarchy in the 5D parameters. 

The central feature of a warped extra dimension is that the effective 4D 
mass scale depends on position in the extra dimension. In order to have a 
more intuitive understanding of this feature, consider the position y ~ yo 
where the metric is: 

(ds) iyo Ye uv (de) (dr) + (dy)? (1.78) 
In terms of the rescaled coordinate and mass scale: @ = e~*#2, ñap = 
emp, we get 


(d8)jvyo ~ Nav (d&)"(dé)” + (dy)? (1.79) 


The advantage of the new coordinates ĉ is that we have a “flat” metric 
in terms of it so that we expect tap ~ Msp (such a relationship is valid 
in the absence of warping). Converting back to original mass scales, we 
find map ~ e~*¥msp, i.e., 4D mass scales are warped compared to 5D 
mass scales. An analogy with the expanding Universe is useful: just as 3D 
space expands with time, in the warped extra dimension, the 4D space-time 
“expands” (or contracts) with motion along the 5‘” dimension. 


1.6.3. Summary of RS1 


The preceding discussion leads us to the “master equation” for a warped 
extra dimension: 


My, off. (Y) ~ Ms xe 


relating the effective 4D mass scales on the left-hand side (LHS) of the 
above equation to the fundamental or 5D mass scale on the right-hand 
side (RHS) by the warp factor. Applying it to the case of the 4D graviton 
localized at y ~ 0, we get 


Mp, ~ Ms (1.80) 
so that we must choose the 5D Planck scale to be 
Ms ~ 10'8GeV (1.81) 
Whereas, the Higgs sector is localized at y ~ 7R so that 


M weak ~ Ms wert" (1.82) 
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so that 


Myeak ~ TeV (1.83) 


provided we have a mild hierarchy 


krR ~ log (Mpı/TeV) 
~ 30 (1.84) 


1.6.4. Similarity with Flat TeV-Size Extra Dimension with 
Large Brane Terms 


In the original RS1, it was assumed that the entire SM, i.e., including 
fermion and gauge fields, is localized on the TeV brane. However, it was 
subsequently realized that, in oder to solve the Planck-weak hierarchy prob- 
lem, only the SM Higgs boson has to be localized on or near the TeV brane 
— the masses of non-Higgs fields, i.e., fermions and gauge bosons, are pro- 
tected by gauge and chiral symmetries, respectively. 

So, we are led to consider RS1 with the SM gauge [17] and fermion 
fields [18] propagating in the bulk (with the Higgs still being on or near 
the TeV brane). It turns out that the profiles for the SM fermions in 
the bulk can address the flavor hierarchy just as in the case of flat extra 
dimension. Moreover, solving the wave equation in curved spacetime, we 
can show [17-19] that all KK modes are localized near the IR brane (that 
too automatically, i.e., without actual brane terms) and the KK masses are 
given by mgg ~ ke~*"™® and not 1/R [note that, based on Eqs. (1.73) and 
(1.77) 1/R is of the size of the 4D Planck scale!]. Hence, we find mgg ~ 
TeV given the choice of parameters to solve the Planck-weak hierarchy 
problem! A very rough intuition for localization of KK modes near the 
TeV brane is that the KK modes can minimize their mass by “living” 
near IR brane, where all mass scales are warped down. In this sense, the 
warped extra dimension “mimics” large brane kinetic terms of flat geometry 
— recall that the large brane kinetic terms in a flat extra dimension result 
in a similar localization of KK modes. In addition, the hierarchy mgg < 
Mp, is explained by the warped geometry. This addresses the 1st and 2nd 
questions outlined at the end of the previous lecture. 

Because of this localization of KK modes near the TeV brane, we find 
that the solution to the flavor problem and the discussion of the electroweak 
precision tests (including custodial isospin) goes through (roughly) as in the 
case of a flat extra dimension. 
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1.6.5. Unification of Spins: Higgs as As 


We now return to the other (3rd and 4th) questions asked at the end of the 
previous lecture, namely, what sets the scale of EWSB or Higgs mass and 
why is Higgs localized on the TeV brane? 

We will show in this and the next subsection that obtaining the SM 
Higgs as the 5th component of 5D gauge field (or As) can resolve the 3rd 
question and then outline in the final subsection how combining the idea of 
Higgs as As with the warped geometry answers the 4th question, resulting 
in a “complete” model. 

As a warm-up for the idea of Higgs as As (see the review [20] for ref- 
erences), consider an SU(2) gauge theory in an extra dimension which is 
compactified on a circle ($1). As we saw earlier, for n Æ 0, the AÇ? modes 
“eat” Ao”) modes to form massive spin-1 states. Moreover, there is a (mass- 
less) zero-mode As, which is in adjoint representation of SU(2), i.e., it is 
charged under the SU(2) gauge symmetry. We can introduce a SU(2) dou- 
blet fermion in the bulk which will acquire a Yukawa coupling ~ g to the 
As zero-mode from the interaction Ù z AsV p coming from the 5D covariant 
derivative. Hence, this scenario is often called “Gauge- Yukawa unification”. 

Note that this scalar has no potential at the tree-level since it is part 
of a 5D gauge field. We will now discuss the potential for A5 zero-mode 
induced by loop effects to find that it is finite. Naively, the scalar (mass)? 
gets quadratically divergent loop corrections: m?o ~ g3/ (167?) Ay. 
However, from the 5D point of view, it is clear that 5D gauge invariance 
protects the As scalar mass from receiving divergent loop corrections (there 
is no counter-term to absorb such divergences and so these must be absent). 
The reader is referred to the 1st reference in [3] for a detailed calculation of 
mio coming from a fermion loop for the simpler case of a U(1) gauge field 


in the bulk. The summary is that loop contributions to m?o are “cut-off” 
5 
by RoI: 


2 J4 p-2 
~ ~ R 1. 
mo = (1.85) 


Intuitively, the understanding is that As behaves as a “regular” scalar till 
E ~ Rt: see Fig. 1.13(a). Beyond these energies, the quantum correc- 
tions “realize” that As is part of a 5D gauge field. Therefore, the loop 
contributions from E © R7! are highly suppressed, in particular, there is 
no divergence. Thinking in terms of KK modes, there is a cancellation 
in the loop diagram among the different modes. We can then ask: what 
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(a) (b) 
4 E 4 
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4 
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a dł = 5D EFT (SM + Kk’s) 
$ a —— cancellation in mo 
between KK’s 5 
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As” is “regular” scalar 4D EFT: 0 or lightest modes 
0 MLE yi Smee lee cee aa ee en chet ae gate 2S ak tal 0 Pais heal Sash lh Saag fest 


Fig. 1.13. Contributions to mass of As (a) and various energy scales in the 5D model 


(b). 


3 


did we gain relative to a “regular” scalar (which is not an As zero-mode, 
but is localized on a brane or originates in a 5D scalar field)? To answer 
this question, we need to know what is Ayy, the scale which cuts of the 
divergence in the case of a regular scalar. The 4D SM (without gravity) 
is renormalizable so that the cut-off is the Planck scale (where quantum 
gravity becomes important). However, the 5D gauge theory, even without 
gravity, is non-renormalizable and therefore must be defined with a cut-off 
(which is not related to the Planck scale): see Fig. 1.13(b). The reason is 
that the 5D gauge coupling constant is dimensionful so that the 5D loop 
expansion grows with energy: g?E/(16x?). Since we cannot extrapolate 
the 5D gauge theory beyond the energy scale where the loop expansion 
parameter becomes ~ 1, we must introduce a cut-off at this scale: 


(1.86) 


where we have set the brane terms to be small so that g4 ~ gs/sqrtR. Note 

that this cut-off is not much larger than the compactification scale since 

ga ~ 1 in the SM. Thus, we find that m? ©) is suppressed relative to the 
5 
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mass” in the case of a regular scalar by ~ (Ayy R)? ~ (167/92)”: we do 
gain by going to As. 

Next, we discuss how to use As for radiative symmetry breaking (often 
called Hosotani mechanism) [21]. Continuing with the case of SU(2) on St, 
we see that a vev for the As zero-mode, (A) can break SU(2) gauge sym- 
metry to a U(1) gauge symmetry. The point is that fermion loops typically 
give mo < 0, whereas gauge loops are of opposite sign. However, the 


fermion contributions can win if the number of fermion degrees of freedom 
is larger than that of gauge bosons. 

Thus, we have a “cartoon” of the SM in the following sense. We can 
identify the SU(2) gauge group that we considered above with the SM 
W’s. We will then get Mw+ ~ R! (coming from (As)), whereas W3 
[corresponding to the unbroken U(1) gauge symmetry] remains massless (it 
is the “photon”). Finally, the Yz AsV, coupling mentioned above gives a 
fermion mass Mpo) ~ R! ~ Mw which is roughly correct for top quark 
(since m ~ My). 

Of course, this model is far from being realistic: 


(i). We must require 1/R >> 100 GeV since we have not seen any KK 
modes in experiments so far which have probed energy scales up to 
~ TeV (either directly in the highest energy colliders or indirectly 
via virtual effects of new particles). To satisfy this constraint, we 
can fine-tune the fermion versus the gauge loop contributions to 
As mass such that My+ or (As) ~ 100 GeV « R7!. 

(ii). More importantly, we do not have fermion chirality on a circle. 


1.6.6. Towards Realistic Higgs as As: Chirality and Enlarg- 
ing the Gauge Group 


As we saw earlier, we can obtain chiral fermions by going to an orbifold: 
S'/Z. However, if we require A, of SU(2) to be even under Z2 (such 
that we get a corresponding zero-mode, i.e., a massless 4D gauge boson), 
then the As’s are necessarily odd. Thus, we lose the scalar zero-mode. In 
any case, the scalar was in the adjoint representation of SU(2), whereas we 
need a doublet for EW symmetry breaking. 

The trick is to enlarge the gauge group to SU(3) and to break it down 
to SU(2) x U(1) by boundary condition as follows. Choose the following 
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parities under Z2 for the fundamental representation 


3] —- P13], where 


P= + (1.87) 


Given this parity choice, can derive the transformation of any other repre- 
sentation under Z2. For example, consider fields in the adjoint represen- 
tation, ®, (a = 1...8), written as a 3 x 3 matrix, 6,T%, where T°’s are 
generators of the fundamental representation. This matrix transforms as 


8s |= P |8| P| ++- (1.88) 
--+ 


This implies that if the A,,’s belonging to SU(2) x U(1) are chosen to 
be even (and hence have a zero-mode), then the A,,’s of the coset group 
SU(3)/ [su(2) x u(1)] are odd (i.e., do not have a zero-mode). This choice 
of parities thus achieves the desired breaking pattern SU (3) — SU (2) x 
U(1). Moreover, the A5’s of su(3)/[su(2) x U(1)] are even, giving us a 
scalar zero-mode which is a doublet of the unbroken SU (2) group as desired. 

Furthermore, just like in the case of the breaking SU(2) — U(1) dis- 
cussed earlier, the breaking of SU(2) x U(1) can be achieved by vev of As 
which is generated by loop corrections. Moreover, due to usage of funda- 
mental representation for this radiative symmetry breaking, the rank of the 
gauge group is also broken, i.e., we have an unbroken U(1) symmetry. 

A 5D fermion which is a triplet of SU (3) gives zero-modes for LH SU(2) 
doublet and RH singlet: 


iyo & a (1.89) 


where D and S denote SU (2) doublet and singlet, respectively — recall 
that the parities of the RH and LH fields must be opposite. Moreover, the 
Yukawa coupling for the zero-mode fermions comes from the interaction 
YP AsV. Thus, we are getting closer to the SM! 
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1.6.7. Realistic Higgs as As in Warped Extra Dimension 


When we construct the previous model in a warped extra dimension, it 
turns out that the A is automatically localized near the TeV brane [22] 
— recall that in order to solve the hierarchy problem, we would like the 
Higgs to be localized precisely there. Thus, As zero-mode is an excellent 
candidate for SM Higgs! 

As “finishing touches”, we can add an extra U(1) to obtain the correct 
hypercharges for the fermions and similarly a custodial isospin symmetry 
to protect the p parameter [23]. Also, it turns out that the WP and YẸ 
have (effectively) “opposite” sign of 5D mass, M (recall that this mass is 
not coming from (AM) in the sense that if the LH zero-mode is localized 
near y = 0, then the RH zero-mode must be near y = TR (or vice versa): 
see exercise 1. To relax this constraint, i.e., to obtain more freedom in 
localization of LH versus RH zero-modes, we can instead obtain LH and RH 
SM fermions as zero-modes of different bulk multiplets. However, then the 
question arises: since As only couples fermions within the same fermionic 
multiplet, how do we obtain Yukawa couplings? The solution is to mix 
fermionic multiplets by adding mass terms localized at the endpoints of the 
extra dimension. 


1.6.8. Epilogue 


Due to lack of time, we have not considered other extra dimensional mod- 
els with connections to the weak scale (and gravitational aspects of extra 
dimensional models in general). Here, we give a summary of the essential 
features of these other models: for details, see the references below and 
other lectures [1-4]. Arkani-Hamed, Dimopoulos and Dvali (ADD) pro- 
posed a scenario where only gravity propagates in extra dimensions, with 
all the SM fields localized on a brane [24]. The idea is that the fundamen- 
tal or higher-dimensional gravity scale is ~ TeV (and not the 4D Planck 
scale), while the weakness of gravity (or largeness of 4D or observed Planck 
scale) is accounted for by diluting the strength of gravity using extra di- 
mensions which are much larger in size than the fundamental length scale, 
ie., R>>1/ TeV. The crucial point is that the gravitational force law has 
been tested only for distances larger than O (100) um so that such very 
large extra dimensions could be consistent with current experiments. Only 
the graviton has KK modes in this framework, that too very light, result- 
ing in interesting phenomenology both from real and virtual production of 
these KK modes. These KK modes couple with the usual 4D gravitational 
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strength, but their large multiplicity can compensate for this very weak 
coupling. 

At the other extreme is the model called Universal Extra Dimensions 
(UED) [25]. This scenario has a flat extra dimension(s) in which all the SM 
fields (including Higgs) propagate. The 5D fields have no brane localized 
interactions at the tree-level: of course, loops will generate small brane 
terms. Moreover, there are no 5D masses for fermions and Higgs so that 
profiles for all zero-modes (including all fermions, gauge fields and Higgs) 
are flat. Hence, we do not have a solution to the flavor hierarchy of the 
SM unlike in the scenario considered in these lectures. The motivation for 
UED is more phenomenological: there is a remnant of extra dimensional 
momentum or KK number conservation (dubbed KK parity) which forbids 
a coupling of a single lightest (level-1 and in general, odd level) KK mode 
to SM particles. Such a coupling is allowed for level-2 (and in general, 
even level) KK modes, but it is still suppressed by the small (loop-induced 
size) of brane kinetic terms.8 Hence, the contributions from KK exchange 
to precision tests are suppressed (in particular, tree-level exchange of odd 
level modes is forbidden), easily allowing KK mass scale below a TeV for 
level-1 and even level-2 modes (cf. the lower limit of a few TeV in the 
scenario studied in these lectures). Thus, KK modes can be more easily 
produced at colliders (even though it is clear that the odd level KK modes 
have to be pair produced). Moreover, the lightest KK particle (LKP) is 
stable and can be a good dark matter candidate [26]. 

Finally, we mention the 5D Higgsless models [27], where EW symmetry 
itself is broken by boundary conditions like the breaking of 5D custodial 
isospin symmetry mentioned in lecture 4 [or the breaking SU (3) — SU (2) x 
U(1) considered in lecture 5 in order to obtain Higgs as As]. The idea is 
that there is no light Higgs in the spectrum in order to unitarize WW 
scattering, which is instead accomplished by exchange of gauge KK modes. 
These KK modes then must have mass ~ 1 TeV. It turns out that due 
to such a low KK scale, the simplest such models are severely constrained 
by precision tests, but it is possible to avoid some of these constraints by 
suitable model-building. 


SThis coupling does not preserve KK number conservation or extra dimensional trans- 
lation invariance and hence must arise from interactions localized on the branes which 
violate these symmetries. 
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Appendix A. Excercises 
A.1. Exercise 1 


A.1.1. Zero-Mode Fermion and 4D Yukawa Coupling 


Show that the normalized profile for LH zero-mode fermion (i.e., choosing 
Yz to be even) is (lecture 2): 


/ M 
froly) = RT (0<y<7R) 
| M x 
= \ RIT My (0>y>-rR) (A.1) 


where the normalization is over 0 < y < 27R (even though the physical 
domain is from y = 0 to y = tR). Similarly, if we choose Vp to be even 
instead of Yz, then the RH zero-mode profile is 


-M = 
froly) = Y amn E 
[_—M 
> e-2MnrR _ ger (0 2y2 =r R) (A.2) 


Note the opposite sign of M in the LH versus RH zero-mode profiles [fol- 
lowing from Eqs. (1.20) and 1.21)]. Assuming that the SM Higgs field is 
localized at y = 7R, we see that we need M < 0 (> 0) for LH (RH) fermion 
to obtain small fermion wavefunction at the location of the Higgs and hence 


small 4D Yukawa couplings for light fermions (1st and 2nd generations). 
EMTR 


0<y<7R) 


So, we can neglect e7 compared to 1 wherever appropriate. 

The zero-mode (4D or SM) Yukawa coupling in terms of the 5D Yukawa 
coupling: f dyd*z6(y)\Asp HWW’, [where Yz is SU(2)z doublet and WV’, is 
SU(2)z singlet] is: 

A4D ye AspMe2M@rR (A.3) 
and the 4D mass of fermion is 


my AaDv, (A.4) 
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where, for simplicity, we assume equal size of 5D masses, i.e., M = —M’, 
for doublet and singlet fermions. 


A.1.2. Coupling of Zero-mode Fermion to Gauge KK mode: 
No Brane Kinetic Terms 


The profile for nth gauge KK mode (mpn = n/R) is: 


fnly) = cos (Mny) (A.5) 


1 
VTR 
Calculate the coupling of zero-mode fermion to gauge KK modes in terms 
of the coupling of zero-mode gauge field (i.e., SM gauge coupling), g4 = 


gs/V2TR: 
g (n, M) = gaa (n, M) (A.6) 


You should obtain: 


4M? 


a(n, M) & aint 


(A.7) 

Use ma,s = 1 MeV, 100 MeV and the Higgs vev v ~ 100 GeV. Assume, 
for simplicity, that Asp M = 1 for both s,d — otherwise, we have to solve 
a transcendental equation to obtain M (given the 4D Yukawa coupling). 


Calculate the 5D masses Ms a and show that a(1, Ms) — a(1, Ma) + 0.1. 
Compare K — K mixing from KK Z exchange as in lecture 2 


I Boe y 2 
5 [a (1,M;)—a(1, Ma) | (mixing angle) (A.8) 


MKK 


to the SM amplitude 


A? t2 
Te i (mixing angle)? (A.9) 


to obtain bound on mgg of ~ 20 TeV, using gz + 0.75 and g2 ~ 0.65 for 
the SM Z and SU(2),, gauge couplings. 

It turns out that another observable called eg (which is the imaginary 
or CP-violating part of the above K — K mixing amplitude) gives a stronger 
bound on KK mass scale of ~ 100 TeV. 
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A.2. Exercise 2 


A.2.1. General Brane Kinetic Terms 


The Lagrangian is 


1 1 
Lsp 3 —7FuËÊ FM" — FOU) Fuu F” (A.10) 


where r has dimension of length. 


Go through the derivation outlined in lecture 3, i.e., fn satisfies the 


orthonormality condition: 


J Rao [LA r5(W)] = Sm (A.11) 


and the differential equation: 


[2 + m? (1 + r6(y)) | fay) = 0 (A.12) 


The solution is 


fn(y) = an cos (mny) 4 
= dn, cos (Mny) J 


+ bn sin (Mny) for y > 0 
+ bn sin (mny) for y < 0 (A.13) 


Use the following 4 conditions to obtain relations between coefficients a, b’s 
and to solve for mp: (i) continuity at y = 0, (ii) discontinuity in derivative 
matches brane term, (iii) fn is even and (iv) periodicity of fn. In particular, 
condition (iv) is satisfied by repeating (or copying) fn between —7R and 
TR to between 7R and 37R and so on. However, continuity of fn at y= tR 


has to be imposed and similarly that 
kinetic term at y = TR). 
You should find 


of derivative of f» (assuming no brane 


An = Gn 

bn o rma 

an 2 

bn = —bn 

bn 

A tan (Mn TR) (A.14) 


so that eigenvalues are solutions to 


tan(m,7R) = — 


TIM 


; (A.15) 
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Finally, calculate 


1 
ae (1 + irm? + — ) (A.16) 
from normalization. 


A.2.2. Large Brane Kinetic Terms 
Verify approximate results shown in lecture 3 for large brane kinetic terms, 
r/R > 1, namely, 


(i) mn © (n + 1/2) /R, 


(ii) 1/94 = r/g5 
and for lightest KK modes (small n) 


(iii) coupling of a fermion localized at y = 0 to gauge KK mode ~ g4/\/r/R 


(iv) coupling of gauge KK mode to a fermion/Higgs field localized on y = 


TR brane ~ gay r/R. 


We can generalize these couplings of gauge KK mode to the case of a 
zero-mode fermion with a profile in the bulk — it’s just that we have to 
do an overlap integral as in problem 2 of exercise 1. Calculate the new 
a(l, Ms) —a(1, Ma). For r/R > 1, show that it is smaller than before (i.e., 
without brane terms) so that K — K mixing is suppressed and a lower KK 
mass scale is allowed. 


A.3. Exercise 3 


As discussed in lecture 3, the zero and KK modes of Z are defined by setting 
the Higgs vev to zero. However, due to non-zero Higgs vev, the zero and 
KK modes of Z mix via mass terms — kinetic terms are still diagonal. The 
Z-Z (i.e., 1% KK mode of Z) mass matrix is: 


2 2 u (0) 
(0) (1) m Am Z 
VETE (zi zę ) 6 2 4? ) a (A.17) 
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where m? = 1/4 g7.)v*, mixing term Am? = 1/4 gzogs z fı (TR) v? and 


M? = mg +1/4 g2 zf? (aR) v?. Here, fı (mR) is wavefunction of Z) 
evaluated at the Higgs brane (y = mR). Also, gzo) = gs z/V20R+r 
denotes the coupling of 7) (where r is the brane kinetic term at y = 0) 
and gs z = \/9%. +957 denotes the 5D coupling of Z, with gs2 and g$ 
being the 5D gauge couplings of SU(2) and U(1)y, respectively (assume, 
for simplicity, the same brane kinetic term r for all gauge fields). 

Diagonalize this mass matrix, assuming v? /mĉ, g x gauge couplings < 1 
where appropriate, i.e., determine 


(i) the unitary transformation to go from (ZZ) to physical basis and 


(ii) the eigenvalues of the mass matrix. 


There are 2 effects of this diagonalization. 


A.3.1. Shift in Coupling of a Fermion to Z 


Given couplings of a fermion to Z© and Z“ (KK basis) 


M z® 
Lecoupling 2 py” (g, G) ZO) p (A.18) 
b 


use the above unitary transformation to calculate the couplings to the 
fermion in the physical basis, denoted by Zyigne (which is SM Z) and Zheavy- 

Specifically, calculate the coupling of a fermion localized at y = 0 to 
the SM Z using g = gz) and G = gs z fı (0) in the above equation, where 
fi(0) is wavefunction of Z evaluated at the fermion brane (y = 0). 

Verify that the shift in the coupling of this fermion to the SM Z from 
the zero-mode Z coupling (i-e., gz) ) is as shown in lecture 3: 6gz/gzi0) ~ 
Iz) %'/MK, in particular, that there is no enhancement for large brane 
kinetic terms, r/R > 1. 


A.3.2. Shift in Z mass 


The lighter eigenvalue of mass matrix is the SM Z mass. Verify that the 
shift in the SM Z mass from the purely zero-mode mass, i.e., 1/4g3()v”, 
is as shown in lecture 3, in particular, that there is an enhancement in this 
shift due to r/R >> 1 (when the shift is expressed in terms of gz.0)). 
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The International Linear Collider (ILC) is the next large scale project 
in accelerator particle physics. Colliding electrons with positrons at en- 
ergies from 0.3 TeV up to about 1 TeV, the ILC is expected to provide 
the accuracy needed to complement the LHC data and extend the sensi- 
tivity to new phenomena at the high energy frontier and answer some of 
the fundamental questions in particle physics and in its relation to Cos- 
mology. This paper reviews some highlights of the ILC physics program 
and of the major challenges for the accelerator and detector design. 


2.1. Introduction 


Accelerator particle physics is completing a successful cycle of precision 
tests of the Standard Model of electro-weak interactions (SM). After the 
discovery of the W and Z bosons at the S'ppS hadron collider at CERN, the 
concurrent operation of hadron and ete” colliders has provided a large set 
of precision data and new observations. Two e*e™ colliders, the SLAC Lin- 
ear Collider (SLC) at the Stanford Linear Accelerator Center (SLAC) and 
the Large Electron Positron (LEP) collider at the European Organization 
for Nuclear Research (CERN), operated throughout the 1990’s and enabled 
the study of the properties of the Z boson in great detail. Operation at 
LEP up to 209 GeV, the highest collision energy ever achieved in electron- 
positron collisions, provided detailed information on the properties of W 
bosons and the strongest lower bounds on the mass of the Higgs boson and 
of several supersymmetric particles. The collision of point-like, elemen- 
tary particles at a well-defined and tunable energy offers advantages for 
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precision measurements, as those conducted at LEP and SLC, over proton 
colliders. On the other hand experiments at hadron machines, such as the 
Tevatron pp collider at Fermilab, have enjoyed higher constituent energies. 
The CDF and DO experiments eventually observed the direct production of 
top quarks, whose mass had been predicted on the basis of precision data 
obtained at LEP and SLC. 

While we await the commissioning and operation of the LHC pp collider 
at CERN, the next stage in experimentation at lepton colliders is actively 
under study. For more than two decades, studies for a high-luminosity 
accelerator, able to collide electrons with positrons at energies of the order 
of 1 TeV, are being carried out world-wide. 


2.2. The path towards the ILC 


The concept of an ete~ linear collider dates back to a paper by Maury 
Tigner! published in 1965, when the physics potential of e+e~ collisions had 
not yet been appreciated in full. This seminal paper envisaged collisions 
at 3-4 GeV with a luminosity competitive with that of the SPEAR ring 
at SLAC, i.e. 3 x 109° cm7? s~!. A possible scheme to obtain e~e~ and 
ete collisions at energies of hundreds of GeV is the title of a paper? by 
Ugo Amaldi published a decade later in 1976, which sketches the linear 


collider concept with a design close to that now developed for the ILC. The 
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Fig. 2.1. The linear collider layout as sketched in 1975 in one of the figures of Ref. 2. 
The paper discussed the possibility to achieve e~e~ and e+e7 collisions at 0.3 TeV using 
superconducting linacs with a gradient of 10 MV/m. 


parameters for a linear collider, clearly recognised as the successors of ebe7 
storage rings on the way to high energies, were discussed by Burt Richter 
at the IEEE conference in San Francisco in 1979? and soon after came the 
proposal for the Single Pass Collider Project which would become SLC at 
SLAC. 

From 1985, the CERN Long Range Planning Committee considered an 
ete” linear collider, based on the CLIC* design, able to deliver collisions 
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at 2 TeV with 1033 cm~? s7! luminosity, vis-a-vis a hadron collider, with 
proton-proton collisions at 16 TeV and luminosity of 1.4 x 1033 cm7? s71, 
as a candidate for the new CERN project after LEP. That review process 
eventually led to the decision to build the LHC, but it marked an important 
step to establish the potential of a high energy ete~ collider. It is important 
to note that it was through the contributions of several theorists, including 
John Ellis, Michael Peskin, Gordon Kane and others, that the requirements 
in terms of energy and luminosity for a linear collider became clearer in the 
mid 1980’s.° The SLC project gave an important proof of principle for 
a high energy linear collider and the experience gained has shaped the 
subsequent designs in quite a significant way. 

After a decade marked by important progress in the R&D of the basic 
components and the setup of advanced test facilities, designs of four dif- 
ferent concepts emerged: TESLA, based on superconducting RF cavities, 
the NLC/JLC-X, based on high frequency (11.4 GHz) room-temperature 
copper cavities, JLC-C, based on lower frequency (5.7 GHz) conventional 
cavities and CLIC, a multi-TeV collider based on a different beam accelera- 
tion technique, the two-beam scheme with transfer structures operating at 
30 GHz. Accelerator R&D had reached the maturity to assess the technical 
feasibility of a linear collider project and take an informed choice of the most 
advantageous RF technology. The designs were considered by the Interna- 
tional Linear Collider Technical Review Committee (ILC-TRC), originally 
formed in 1994 and re-convened by the International Committee for Future 
Accelerators (ICFA) in 2001 under the chairmanship of Greg A. Loew. The 
ILC-TRC assessed their status using common criteria, identified outstand- 
ing items needing R&D effort and suggested areas of collaboration. The 
TRC report was released in February 2003° and the committee found that 
there were no insurmountable show-stoppers to build TESLA, NLC/JLC-X 
or JLC-C in the next few years and CLIC in a more distant future, given 
enough resources. Nonetheless, significant R&D remained to be done. At 
this stage, it became clear that, to make further progress, the interna- 
tional effort towards a linear collider should be focused on a single design. 
ICFA gave mandate to an International Technology Recommendation Panel 
(ITRP), chaired by Barry Barish, to make a definite recommendation for a 
RF technology that would be the basis of a global project. In August 2004 
the ITRP made the recommendation in favour of superconducting RF cav- 
ities.” The technology choice, which was promptly accepted by all labo- 
ratories and groups involved in the R&D process, is regarded as a major 
step towards the realization of the linear collider project. Soon after it, a 
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truly world-wide, centrally managed design effort, the Global Design Effort 
(GDE),® a team of more than 60 persons, started, with the aim to produce 
an ILC Reference Design Report by beginning of 2007 and an ILC Tech- 
nical Design Report by end of 2008. The GDE responsibility now covers 
the detailed design concept, performance assessments, reliable international 
costing, industrialization plan, siting analysis, as well as detector concepts 
and scope. A further important step has been achieved with release of the 
Reference Design Report in February 2007.9 This report includes a pre- 
liminary value estimate of the cost for the ILC in its present design and at 
the present level of engineering and industrialisation. The value estimate 
is structured in three parts: 1.78 Billion ILC Value Units for site-related 
costs, such as those of tunneling in a specific region, 4.87 Billion ILC Value 
Units for the value of the high technology and conventional components and 
13,000 person-years for the required supporting manpower. For this esti- 
mate the conversion factor is 1 ILC Value Unit = 1 US Dollar = 0.83 Euro 
= 117 Yen. This estimate, which is comparable to the LHC cost, when the 
pre-existing facilities, such as the LEP tunnel, are included, provides guid- 
ance for optimisation of both the design and the R&D to be done during 
the engineering phase, due to start in Fall 2007. 

Technical progress was paralleled by increasing support for the ILC in 
the scientific community. At the 2001 APS workshop The Future of Physics 
held in Snowmass, CO, a consensus emerged for the ILC as the right project 
for the next large scale facility in particle physics. This consensus resonated 
and expanded in a number of statements by highly influential scientific advi- 
sory panels world-wide. The ILC role in the future of scientific research was 
recognised by the OECD Consultative Group on High Energy Physics,!° 
while the DOE Office of Science ranked the ILC as its top mid-term project. 
More recently the EPP 2010 panel of the US National Academy of Sciences, 
in a report titled Elementary Particle Physics in the 21% Century has en- 
dorsed the ILC as the next major experimental facility to be built and its 
role in elucidating the physics at the high energy frontier, independently 
from the LHC findings.‘! Nowadays, the ILC is broadly regarded as the 
highest priority for a future large facility in particle physics, needed to 
extend and complement the LHC discoveries with the accuracy which is 
crucial to understand the nature of New Physics, test fundamental prop- 
erties at the high energy scale and establish their relation to other fields 
in physical sciences, such as Cosmology. A matching program of physics 
studies and detector R&D efforts has been in place for the past decade and 
it is now developing new, accurate and cost effective detector designs from 
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proof of concepts towards that stage of engineering readiness, needed for 
being adopted in the ILC experiments. 


2.3. ILC Accelerator Parameters 
2.3.1. ILC Energy 


The first question which emerges in defining the ILC parameters is the 
required centre-ofmass energy ys. It is here where we most need physics 
guidance to define the next thresholds at, and beyond, the electro-weak 
scale. The only threshold which, at present, is well defined numerically 
is that of top-quark pair production at ys ~ 350 GeV. Beyond it, there 
is a strong prejudice, supported by precision electro-weak and other data, 
that the Higgs boson should be light and new physics thresholds may exist 
between the electro-weak scale and approximately 1 TeV. If indeed the SM 
Higgs boson exists and the electro-weak data is not affected by new physics, 
its mass My is expected to be below 200 GeV as discussed in section 2.4.1. 
Taking into account that the Higgs main production process is in association 
with a Z? boson, the maximum of the ete~ — H? Z? cross section varies 
from \/s = 240 GeV to 350 GeV for 120 GeV < My < 200 GeV. On the 
other hand, we know that the current SM needs to be extended by some New 
Physics. Models of electroweak symmetry breaking contain new particles 
in the energy domain below 1 TeV. More specifically, if Supersymmetry 
exists and it is responsible for the dark matter observed in the Universe, 
we expect that a significant fraction of the supersymmetric spectrum would 
be accessible at \/s = 0.5-1.0 TeV. In particular, the ILC should be able to 
study in detail those particles determining the dark matter relic density in 
the Universe by operating at energies not exceeding 1 TeV, as discussed in 
section 2.4.2. Another useful perspective on the ILC energy is an analysis 
of the mass scale sensitivity for new physics vs. the ys energy for lepton 
and hadron colliders in view of their synergy. The study of electro-weak 
processes at the highest available energy offers a window on mass scales 
well beyond its kinematic reach. A comparison of the mass-scale sensitivity 
for various new physics scenarios as a function of the centre-of-mass energy 


for et 


e and pp collisions is given in section 2.4.3. These and similar 
considerations, emerged in the course of the world-wide studies on physics 
at the ILC, motivate the choice of ys = 0.5 TeV as the reference energy 
parameter, but requiring the ILC to be able to operate, with substantial 
luminosity, at 0.3 TeV as well and to be upgradable up to approximately 


1 TeV. 
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It is useful to consider these energies in an historical perspective. In 1954 
Enrico Fermi gave a talk at the American Physical Society, of which he was 
chair, titled What can we learn with high energy accelerators ?. In that talk 
Fermi considered a proton accelerator with a radius equal to that of Earth 
and 2 T bending magnets, thus reaching a beam energy of 5 x 101° eV.” 
Stanley Livingstone, who had built with Ernest O. Lawrence the first cir- 
cular accelerator at Berkeley in 1930, had formulated an empirical linear 
scaling law for the available centre-of-mass energy vs. the construction year 
and cost. Using Livingstone curve, Fermi predicted that such an acceler- 
ator could be built in 1994 at a cost of 170 billion $. We have learned 
that, not only such accelerator could not be built, but accelerator physics 
has irrevocably fallen off the Livingstone curve, even in its revised version, 
which includes data up to the 1980’s. As horizons expanded, each step 
has involved more and more technical challenges and has required more 
resources. The future promises to be along this same path. This underlines 
the need of coherent and responsible long term planning while sustaining a 
rich R&D program in both accelerator and detector techniques. 

The accelerator envisaged by Enrico Fermi was a circular machine, as 
the almost totality of machines operating at the high energy frontier still 
are. Now, as it is well known, charged particles undergoing a centripetal 
acceleration a = v?/R radiate at rate P = gh oe If the radius R 
is kept constant, the energy loss is the above rate P times t = 27R/v, 
the time spent in the bending section of the accelerator. The energy loss 


for electrons is W = 8.85 x 19 EGev) MeV per turn while for protons 


is W = 7.8 x 19-3 Ge keV per turn. Since the energy transferred 
per turn by the RF cavities to the beam is constant, G x 2R x F, where 
G is the cavity gradient and F the tunnel fill factor, for each value of 
the accelerator ring radius R there exists a maximum energy Emax beyond 
which the energy loss exceeds the energy transferred. In practice, before this 
value of Emax is reached, the real energy limit is set by the power dumped 
by the beam as synchrotron radiation. To make a quantitative example, 
in the case of the LEP ring, with a radius R = 4.3 km, a beam of energy 
Eveam = 250 GeV, would lose 80 GeV/turn. Gunther Voss is thought to be 
the author of a plot comparing the guessed cost of a storage ring and a linear 
collider as a function of the e*e~ centre-of-mass energy. A \/s = 500 GeV 
storage ring, which would have costed an estimated 14 billions CHF in 
1970’s is aptly labelled as the Crazytron.!® LEP filled the last window of 
opportunity for a storage ring at the high energy frontier. Beyond LEP-2 
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energies the design must be a linear collider, where no bending is applied 
to the accelerated particles. Still the accelerator length is limited by a 
number of constraints which include costs, alignment and siting. Therefore, 
technology still defines the maximum reachable energy at the ILC. 

The ILC design is based on superconducting (s.c.) radio-frequency (RF) 
cavities. While s.c. cavities had been considered already in the 1960’s, it was 
Ugo Amaldi to first propose a fully s.c. linear collider in 1975.? By the early 
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Fig. 2.2. Distributions of gradients measured for pure niobium, nine-cell cavities. After 
electro-polishing an average gradient in excess to 35 MV/m has been obtained. 


1990’s, s.c. cavities equipped already one accelerator, TRISTAN at KEK 
in Japan, while two further projects were in progress, CEBAF at Cornell 
and the LEP-2 upgrade at CERN. LEP-2 employed a total of 288 s.c. RF 
cavities, providing an average gradient of 7.2 MV/m. It was the visionary 
effort of Bjorn Wijk to promote, from 1990, the TESLA collaboration, 
with the aim to develop s.c. RF cavities pushing the gradient higher by 
a factor of five and the production costs down by a factor of four, thus 
reducing the cost per MV by a factor of twenty. Such reduction in cost was 
absolutely necessary to make a high energy collider, based on s.c. cavities, 
feasible. Within less than a decade 1.3 GHz, pure niobium cavities achieved 
gradients in excess to 35 MV/m. This opened the way to their application 
to a ete" linear collider, able to reach centre-of-mass energies of the order 
of 1 TeV, as presented in detail in the TESLA proposal published in 200114 
and recommended for the ILC by the ITRP in 2004.” 

Today, the ILC baseline design aims at matching technical feasibility 
to cost optimisation. One of the major goals of the current effort in the 
ILC design is to understand enough about its costs to provide a reliable 
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Fig. 2.3. Schematic layout of the International Linear Collider. This diagram reflects 
the recommendations of the Baseline Configuration Document, a report published in 
December 2005 that outlines the general design of the machine. (Credit ILC Global 
Design Effort) 


indication of the scale of funding required to carry out the ILC project. 
Preparing a reliable cost estimate for a project to be carried out as a truly 
world-wide effort at the stage of a conceptual design that still lacks much 
of the detailed engineering designs as well as agreements for responsibility 
and cost sharing between the partners and a precise industrialisation plan 
is a great challenge. Still having good cost information as soon as possible, 
to initiate negotiations with the funding agencies is of great importance. 
An interesting example of the details entering in this process is the opti- 
misation of the cost vs. cavity gradient for a 0.5 TeV collider. The site 
length scales inversely with the gradient G while the cost of the cryogenics 
scales as G?/Qp resulting in a minimum cost for a gradient of 40 MW/m, 
corresponding to a tunnel length of 40 km, and a fractional cost increase 
of 10 % for gradients of 25 MV/m or 57 MV/m. The chosen gradient of 
35 MV/m, which is matched by the average performance of the most recent 
prototypes after electro-polishing, gives a total tunnel length of 44 km with 
a cost increment from the minimum of just 1 %. 

Beyond 1 TeV, the extension of conventional RF technology is more 
speculative. In order to attain collisions at energies in excess of about 
1 TeV, with high luminosity, significantly higher gradients are necessary. 
As the gradient of s.c. cavities is limited below ~ 50 MV/m, other av- 
enues should be explored. The CLIC technology,’ currently being de- 
veloped at CERN and elsewhere, may offer gradients of the order of 
150 MV/m,!° allowing collision energies in the range 3-5 TeV with a lu- 


minosity of 109° cm~? s~!, which would support a compelling physics 
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Fig. 2.4. Schematics of the overall layout of the CLIC complex for e+e~ collisions at 
Vs = 3 TeV. (from Ref. 17) 


program.!” While RF cavities are limited to accelerating fields of order 


of 100-200 MV/m, or below, laser-wakefield accelerators are capable, in 
principle, of producing fields of 10-100 GV/m. Recently a 1 GeV e~ beam 
has been accelerated over just 3.3 cm using a 40 TW peak-power laser 
pulse,!® thus opening a possible path towards ultra-high energies in e+e7 
collisions in some more distant future. 


2.3.2. ILC Luminosity 


The choice of a linear collider, rather than a circular storage ring, while 
solving the problem of the maximum reachable energy, introduces the chal- 
lenge of achieving collisions with the required luminosity. The luminos- 
ity, £, defined as the proportionality factor between the number of events 
produced and the process cross section g, has requirements which de- 
pend on the typical values of s-channel cross sections and so scale as 1/s. 
First luminosity requirements were already outlined in the 1980s!%?° as 
LY 2Poeam x 10°3 cm7? s~!, based on the estimated discovery potential. 
But in the present vision of the ILC role in probing the high energy frontier 
new requirements must be considered. One example is the precision study 
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of electro-weak processes to look for deviations from the SM predictions, 
due to effect of new physics at high scales. The e+e~ — bb cross section 
at 1 TeV is just 96 fb, so this would corresponds to less than 10? events 
per year at 1033 cm~? s~!, which is certainly insufficient for the kind of 
precision measurements which we expect from the ILC. Another example 
is offered by one of the reactions most unique to the ILC: the double-Higgs 
production ete~ — HHZ sensitive to the Higgs self-coupling, which has a 
cross section of order of only 0.2 fb at 0.5 TeV. Therefore a luminosity of 
10°4 cm7? s~! or more is required as baseline parameter. 

The luminosity can be expressed as a function of the accelerator param- 
eters as: 

2 
L= famy (2.1) 

Now, since in a linear machine the beams are collided only once and then 
dumped, the collision frequency, frep, is small and high luminosity should 
be achieved by increasing the number of particles in a bunch N, the num- 
ber of bunches n, and decreasing the transverse beam size ø. Viable values 
for N are limited by wake-field effects and the ILC parameters have the 
same number of electrons in a bunch as LEP had, though it aims at a lu- 
minosity three orders of magnitude higher. Therefore, the increase must 
come from a larger number of bunches and a smaller transverse beam size. 
The generation of beams of small transverse size, their preservation dur- 
ing acceleration and their focusing to spots of nanometer size at the in- 
teraction region presents powerful challenges which the ILC design must 
solve. A small beam size also induces beam-beam interactions. On one 
hand the beam self-focusing, due to the electrostatic attraction of particles 
of opposite charges enhances the luminosity. But beam-beam interactions 
also result in an increase of beamstrahlung with a larger energy spread of 
the colliding particles, a degraded luminosity spectrum and higher back- 
grounds. Beamstrahlung is energy loss due to particle radiation triggered 
by the trajectory bending in the interactions with the charged particles in 
the incoming bunch.?! The mean beamstrahlung energy loss, which has to 
be minimised, is given by: 


er? Eom N, Z 


Ops ~ 0.86 —. 
Be 2moc? Oz (dz +y)? 


(2.2) 


Since the luminosity scales as —, while the beamstrahlung energy loss 
zy 


scales as z it is advantageous to choose a large beam aspect ratio, 
x Y 
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with the vertical beam size much smaller than the horizontal component. 
The parameter optimisation for luminosity can be further understood by 
expressing the luminosity in terms of beam power P = frepN Eem = Pac 
and beamstrahlung energy loss as: 


P. ô 
poed AC /°BS 


Hp (2.3) 
cm Ey 

which highlights the dependence on the cavity efficiency 7 and the total 
power Pac. The Hp term is the pinch enhancement factor, that accounts 
for the bunch attraction in the collisions of oppositely charged beams. In 
summary, since the amount of available power is necessarily limited, the 
main handles on luminosity are 7 and ey. The efficiency for transferring 
power from the plug to the beam is naturally higher for s.c. than for conven- 
tional copper cavities, so more relaxed collision parameters can be adopted 
for a s.c. linear collider delivering the same luminosity. The main beam 
parameters for the ILC baseline design are given in Table 2.1. 


Table 2.1. ILC baseline design beam parameters. 


Parameter Vs Vs 
0.5 TeV 1.0 TeV 


Luminosity L (1034 em~?s~!) 2.0 2.8 
Frequency (Hz) 5.0 5.0 

Nb. of particles (101°) 2.0 2.0 
Nb. of bunches Nz 2820 2820 
Bunch spacing (ns) 308 308 
Vertical beam size oy (nm) 5.7 3.5 
Beamstrahlung Parameter pgs 0.022 0.050 
Hp 1.7 1.5 


2.4. ILC Physics Highlights 


The ILC physics program, as we can anticipate it at present, is broad and 
diverse, compelling and challenging. The ILC is being designed for oper- 
ation at 0.5 TeV with the potential to span the largest range of collision 
energies, from the Z° peak at 0.091 TeV up to 1 TeV, collide electrons with 
positrons, but optionally also electrons with electrons, photons with pho- 
tons and photons with electrons, and combine various polarization states 
of the electron and positron beams. Various reports discussing the linear 
collider physics case, including results of detailed physics studies, have been 
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published in the last few years.°!%?2-26 Here, I shall focus on three of the 
main ILC physics themes: the detailed study of the Higgs boson profile, the 
determination of neutralino dark matter density in the Universe from ac- 
celerator data, and the sensitivity to new phenomena beyond the ILC kine- 
matic reach, through the analysis of two-fermion production, at the highest 
ys energy. Results discussed in the following have been obtained mostly 
using realistic, yet parametric simulation of the detector response. Only 
few analyses have been carried out which include the full set of physics and 
machine-induced backgrounds on fully simulated and reconstructed events. 
With the progress of the activities of detector concepts and the definition 
of well-defined benchmark processes, this is becoming one of the priorities 
for the continuation of physics and detector studies. 


2.4.1. The Higgs Profile at the ILC 


Explaining the origin of mass is one of the great scientific quests of our 
time. The SM addresses this question by the Higgs mechanism.?” The 
first direct manifestation of the Higgs mechanism through the Higgs sector 
will be the existence of at least one Higgs boson. The observation of a 
new spin-0 particle would represent a first sign that the Higgs mechanism 
of mass generation is indeed realised in Nature. This has motivated a 
large experimental effort, from LEP-2 to the Tevatron and, soon, the LHC, 
actively backed-up by new and more accurate theoretical predictions. After 
a Higgs discovery, which we anticipate will be possible at the LHC, full 
validation of the Higgs mechanism can only be established by an accurate 
study of the Higgs boson production and decay properties. It is here where 
the ILC potential in precision physics will be crucial for the validation of 
the Higgs mechanism, through a detailed study of the Higgs profile.31 

The details of this study depend on the Higgs boson mass, My. In the 
SM, My = V2Xv where the Higgs field expectation value v is determined 
as (2G r)" 2 = 246 GeV, while the Higgs self-coupling AÀ is not specified, 
leaving the mass as a free parameter. However, we have strong indications 
that My must be light. The Higgs self-coupling behaviour at high ener- 
gies,” the Higgs field contribution to precision electro-weak data®° and the 
results of direct searches at LEP-2?° at \/s > 206 GeV, all point towards a 
light Higgs boson. In particular, the study of precision electro-weak data, 
which are sensitive to the Higgs mass logarithmic contribution to radiative 
corrections, is based on several independent observables, including masses 
(Miop, Mw, Mz), lepton and quark asymmetries at the Z° pole, Z? line- 
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shape and partial decay widths. The fit to eighteen observables results in 
a 95% C.L. upper limit for the Higgs mass of 166 GeV, which becomes 
199 GeV when the lower limit from the direct searches at LEP-2, My > 
114.4 GeV, is included. As a result, current data indicates that the Higgs 
boson mass should be in the range 114 GeV < My < 199 GeV. It is encour- 
aging to observe that if the same fit is repeated, but excluding this time mop 
or My, the results for their values, 1781}? GeV and 80.361+0.020 GeV re- 
spectively, are in very good agreement with the those obtained the direct 
determinations, Mop = 171.442.1 GeV and My = 80.392+0.029 GeV. 

At the ILC the Higgs boson can be observed in the Higgs-strahlung 
production process ete~ — HZ with Z — €*+£—, independent of its decay 
mode, by the distinctive peak in the di-lepton recoil mass distribution. A 
data set of 500 fb™t at \/s = 350 GeV, corresponding to four years of ILC 
running, provides a sample of 3500-2200 Higgs particles produced in the di- 
lepton HZ channel, for My = 120-200 GeV. Taking into account the SM 
backgrounds, dominated by ete~ — Z°Z° and WtW- production, the 
Higgs boson observability is guaranteed up to its production kinematical 
limit, independent of its decays. This sets the ILC aside from the LHC, 
since the ILC sensitivity to the Higgs boson does not depend on its detailed 
properties. 
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Fig. 2.5. The Higgs-strahlung process at the ILC. (a) ete — HZ cross section vs. 
My for ys = 0.35 TeV and 0.5 TeV, (b) reconstructed p+~ recoil mass for various 
values of the Higgs boson mass (Credit: ALCPG Study Group). 


62 Marco Battaglia 


After observation of a new particle with properties compatible with 
those of the Higgs boson, a significant experimental and theoretical effort 
will be needed to verify that this is indeed the boson of the scalar field 
responsible for the electro-weak symmetry breaking and the generation of 
mass. Outlining the Higgs boson profile, through the determination of its 
mass, width, quantum numbers, couplings to gauge bosons and fermions 
and the reconstruction of the Higgs potential, stands as a most challenging, 
yet compelling, physics program. The ILC, with its large data sets at dif- 
ferent centre-of-mass energies and beam polarisation conditions, the high 
resolution detectors providing unprecedented accuracy on the reconstruc- 
tion of the event properties and the use of advanced analysis techniques, 
developed from those successfully adopted at LEP and SLC, promises to 
promote Higgs physics into the domain of precision measurements. Since 
the Higgs mass My is not predicted by theory, it is of great interest to 
measure it precisely. Once this mass, and thus A, is fixed, the profile of 
the Higgs particle is uniquely determined in the SM. In most scenarios we 
expect the LHC to determine the Higgs mass with a good accuracy. At the 
ILC, this measurement can be refined by exploiting the kinematical char- 
acteristics of the Higgs-strahlung production process ete~ — Z* > H? Z? 
where the Z° can be reconstructed in both its leptonic and hadronic decay 
modes. The 4+4- recoil mass for leptonic Z° decays yields an accuracy of 
110 MeV for 500 fb~! of data, without any requirement on the nature of 
the Higgs decays. Further improvement can be obtained by explicitly se- 
lecting H — bb (WW) for My <(>) 140 GeV. Here a kinematical 5-C fit, 
imposing energy and momentum conservation and the mass of a jet pair to 
correspond to Mz, achieves an accuracy of 40 to 90 MeV for 120< My < 
180 GeV.*? 

The total decay width of the Higgs boson is predicted to be too narrow 
to be resolved experimentally for Higgs boson masses below the ZZ thresh- 
old. On the contrary, above ~ 200 GeV, the total width can be measured 
directly from the reconstructed width of the recoil mass peak, as discussed 
below. For the lower mass range, indirect methods must be applied. In 
general, the total width is given by Tyo: = Tx /BR(H — X). Whenever [x 
can be determined independently of the corresponding branching fraction, 
a measurement of Tiot can be carried out. The most convenient choice is the 
extraction of I y from the measurements of the WW fusion cross section 
and the H — WW* decay branching fraction . A relative precision of 6% 
to 13% on the width of the Higgs boson can be obtained at the ILC with 
this technique, for masses between 120 GeV and 160 GeV. The spin, parity 
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Fig. 2.6. Determination of the Higgs boson spin from a scan of the et+e~ — HZ cross 
section at threshold at the ILC (from Ref. 23). 


and charge-conjugation quantum numbers JPO of Higgs bosons can be de- 
termined at the ILC in a model-independent way. Already the observation 
of either yy — H production or H — yy decay sets J # 1 and C = +. 
The angular dependence Wau x sin? @ and the rise of the Higgs-strahlung 
cross section: 


OzHxX Bw s—(My+ Mz)? (2.4) 


allows to determine J’ = 0+ and distinguish the SM Higgs from a C P-odd 
0-* state A°, or a CP-violating mixture of the two.3*°4 But where the 
ILC has a most unique potential is in verifying that the Higgs boson does its 
job of providing gauge bosons, quarks and leptons with their masses. This 
requires to precisely test the relation gyxx x mx between the Yukawa 
couplings, gyxx, and the corresponding particle masses, mx. In fact, the 
SM Higgs couplings to fermion pairs gyff = m/v are fully determined by 
the fermion mass mp. The corresponding decay partial widths only depend 
on these couplings and on the Higgs boson mass, QCD corrections do not 
represent a significant source of uncertainty.” Therefore, their accurate 
determination will represent a comprehensive test of the Higgs mechanism 
of mass generation.’ Further, observing deviations of the measured values 
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Fig. 2.7. Particle couplings to the Higgs field, for a 120 GeV boson, as a function of the 
particle masses. The error bars show the expected ILC accuracy in determining each of 
the couplings. The dark line is the SM prediction, while the shaded gray area shows the 
range of predictions from theories of new physics beyond the SM with extra dimensions 
(Credit: ACFA ILC Study Group). 


from the SM predictions will probe the structure of the Higgs sector and 
may reveal a non-minimal implementation of the Higgs model or the effect 
of new physics inducing a shift of the Higgs couplings.?” 39 The accuracy 
of these measurements relies on the performances of jet flavour tagging 
and thus mostly on the Vertex Tracker, making this analysis an important 
benchmark for optimising the detector design. It is important to ensure 
that the ILC sensitivity extends over a wide range of Higgs boson masses 
and that a significant accuracy is achieved for most particle species. Here, 
the ILC adds the precision which establishes the key elements of the Higgs 
mechanism. It is important to point out that these tests are becoming 
more stringent now that the B-factories have greatly improved the deter- 
mination of the b- and c- quark masses. When one of these studies was first 
presented in 1999,40 the b quark mass was known to +0.11 GeV and the 
charm mass to +0.13 GeV, with the expectation that ete B-factory and 
LHC data could reduce these uncertainties by a factor of two by the time 
the ILC data would be analysed. Today, the analysis of a fraction of the 
BaBar data“! has already brought these uncertainties down to 0.07 GeV for 
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mp and, more importantly, 0.09 GeV for me, using the spectral moments 
technique in semi-leptonic B decays, which had been pioneered on CLEO*? 
and DELPHI data. Extrapolating to the anticipated total statistics to 
be collected at PEP-II and KEKB, we can now confidently expect that the 
b quark mass should be known to better than +0.05 GeV and the charm 
mass to better than +0.06 GeV. This translates into less than +0.4 % and 
+6.5 % relative uncertainty in computing the Higgs SM couplings to b and 
c quarks, respectively, and motivates enhanced experimental precision in 
the determination of these couplings at the ILC. Detailed simulation shows 
that these accuracy can be matched by the ILC.**4” 

While much of the emphasis on the ILC capabilities in the study of the 
Higgs profile is for a light Higgs scenario, preferred by the current electro- 
weak data and richer in decay modes, the ILC has also the potential of 
precisely mapping out the Higgs boson properties for heavier masses. If the 
Higgs boson turns out to weigh of order 200 GeV, the 95% C.L. upper limit 
indicated by electro-weak fits, or even heavier, the analysis of the recoil mass 
in ete — HZ at /s = 0.5 TeV allows to precisely determine My, [7 and 
the Higgs-strahlung cross section. Even for My = 240 GeV, the mass can 
be determined to a 107? accuracy and, more importantly, the total width 
measured about 10% accuracy. Decays of Higgs bosons produced in ete~ > 
Hvv give access to the Higgs couplings. The importance of the WW- 
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Fig. 2.8. H — bb signal after full event selection at the ILC for (a) My = 120 GeV 
and (b) My = 200 GeV (from Ref. 47). 


fusion process ebe~ — H° vi to probe rare Higgs decays at higher energies, 
emerged in the physics study for a multi-TeV linear collider.*° Since this 


66 Marco Battaglia 


cross section increases as log zyz , it becomes dominant around ys = 1 TeV. 
fet 


Detailed studies have been performed and show that 1 ab~! of data at 
\/s = 1 TeV, corresponding to three to four years of ILC running, can 
significantly improve the determination of the Higgs couplings, especially 
for the larger values of My.46*” WW and ZZ couplings can be determined 
with relative accuracies of 3 % and 5 % respectively, while the coupling to 
bb pairs, a rare decay with a branching fraction of just 2 x 107° at such 
large masses, can be determined to 4 % to 14 % for 180 GeV < My < 
220 GeV. This measurement is of great importance, since it would offer the 
only opportunity to learn about the fermion couplings of such an heavy 
Higgs boson, and it is unique to a linear collider. 

A most distinctive feature of the Higgs mechanism is the shape of the 
Higgs potential: 


Basis 
V() = -E0 +3% (2.5) 


with v = fe. In the SM, the triple Higgs coupling, ggyy = 3Av, is 
related to the Higgs mass, My, through the relation 
3 M3 


JHHH = 57° (2.6) 


By determining gy, the above relation can be tested. The ILC has 
access to the triple Higgs coupling through the double Higgs production 
processes ete~ — HHZ and ete~ — HHvv.*® Deviations from the SM 
relation for the strength of the Higgs self-coupling arise in models with 
an extended Higgs sector.4? The extraction of gy is made difficult by 
their tiny cross sections and by the dilution effect, due to diagrams leading 
to the same double Higgs final states, but not sensitive to the triple Higgs 
vertex. This makes the determination of gy yg a genuine experimental tour 
de force. Other modes, such as ete~ — HHbb, have also been recently 
proposed”? but signal yields are too small to provide any precise data. 
Operating at ,/s = 0.5 TeV the ILC can measure the HHZ production 
cross section to about 15% accuracy, if the Higgs boson mass is 120 GeV, 
corresponding to a fractional accuracy on gygy of 23%.°! Improvements 
can be obtained first by introducing observables sensitive to the presence 
of the triple Higgs vertex and then by performing the analysis at higher 
energies where the HHvv channel contributes.°? In the HHZ process 
events from diagrams containing the HH H vertex exhibit a lower invariant 
mass of the HH system compared to double-Higgstrahlung events. When 
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Fig. 2.9. Invariant mass of the HH system in ete~ — HHZ events reconstructed with 
1 ab~! of data at 0.8 TeV. The histograms show the predicted distribution for various 
values of gy gg demonstrating that the low mass region is sensitive to the contribution 
of the triple Higgs vertex. 


the Myy spectrum is fitted, a relative statistical accuracy of +0.20 can be 
obtained with 1 ab™} at \/s = 0.5 TeV. The availability of beam polarization 
increases the HH Z cross section by a factor of two and that for H Hv? by a 
factor of four, thus offering a further possible significant improvement to the 
final accuracy. The ILC and, possibly, a multi-TeV ete~ collider represent 
a unique opportunity for carrying out this fundamental measurement. In 
fact, preliminary studies show that, the analysis of double Higgs production 
at the LHC is only possible after a luminosity upgrade and, even then, 
beyond the observation of double Higgs production, it would provide only 


a very limited information on the triple-Higgs coupling.5354 


2.4.2. Understanding Dark Matter at the ILC 


The search for new physics beyond the Standard Model has a central role 
in the science program of future colliders. It is instructive to contrast the 
LHC and the ILC in terms of their potential in such searches. Running 
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Table 2.2. Summary of the accuracies on the 
determination of the Higgs boson profile at the 
ILC. Results are given for a 350-500 GeV ILC 
with £=0.5 ab~!. Further improvements, ex- 
pected from a 1 TeV ILC are also shown for some 
of the measurements. 


My 5(X)/X 


(GeV) ILC-500 | ILC-1000 
0.5 ab-! | 1 ab™t 


5My/M 120-180 (3-5) x1074 
Ttot/T 120-200 0.03- — | 0.03 - 0.05 
S6g9Hww/g 120-240 0.01-0.03 | 0.01 - 0.01 
bgHzz/9 20-240 0.01-0.05 

Sg utt/9 120-200 0.02- — | 0.06 - 0.13 
bg 16b/9 120-200 0.01-0.06 | 0.01 - 0.05 
6911 ce/9 120-140 0.06-0.12 
bgHrr/9 120-140 0.03-0.05 
69Hpp/9 120-140 0.15 | 0.04-0.06 
CP test 120 0.03 
ÔJHHH/9 120 0.20 | 0.1 


at \/s < 1 TeV the ILC might appear to be limited in reach, somewhere 
within the energy domain being probed by the Tevatron and that to be 
accessed by LHC. And yet its potential for fully understanding the new 
physics, which the LHC might have manifested, and for probing the high 
energy frontier beyond the boundaries explored in hadron collisions is of 
paramount importance. There are several examples of how the ILC will be 
essential for understanding new physics. They address scenarios where sig- 
nals of physics beyond the SM, as observed at the LHC, may be insufficient 
to decide on the nature of the new phenomena. One such example, which 
has been studied in some details, is the case of Supersymmetry and Uni- 
versal Extra Dimensions (UED), two very different models of new physics 
leading to the very same experimental signature: fermion pairs plus missing 
energy. Here, the limited analytical power of the LHC may leave us unde- 
cided,56:57 while a single spin measurement performed at the ILC precisely 
identifies the nature of the observed particles.58 But the ILC capability 
to fully understand the implications of new physics, through fundamental 
measurements performed with high accuracy, is manifested also in scenar- 
ios where the LHC could observe a significant fraction of the new particle 
spectrum. An especially compelling example, which can be studied quan- 
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titatively, is offered by Supersymmetry in relation to Dark Matter (DM). 
Dark Matter has been established as a major component of the Universe. 
We know from several independent observations, including the cosmic mi- 
crowave background (CMB), supernovas (SNs) and galaxy clusters, that 
DM is responsible for approximately 20 % of the energy density of the 
universe. Yet, none of the SM particles can be responsible for it and the 
observation of DM is likely the first direct signal of new physics beyond 
the SM. Several particles and objects have been nominated as candidates 
for DM. They span a wide range of masses, from 10~° eV, in the case of 
axions, to 10~° solar masses, for primordial black holes. Cosmology tells 
us that a significant fraction of the Universe mass consists of DM, but does 
not provide clues on its nature. Particle physics tells us that New Physics 
must exist at, or just beyond, the EW scale and new symmetries may result 
in new, stable particles. Establishing the inter-relations between physics at 
the microscopic scale and phenomena at cosmological scale will represent a 
major theme for physics in the next decades. The ILC will be able to play a 
key role in elucidating these inter-relations. Out of these many possibilities, 
there is a class of models which is especially attractive since its existence 
is independently motivated and DM, at about the observed density, arises 
naturally. These are extensions of the SM, which include an extra sym- 
metry protecting the lightest particle in the new sector from decaying into 
ordinary SM states. The lightest particle becomes stable and can be chosen 
to be neutral. Such a particle is called a weakly interacting massive parti- 
cle (WIMP) and arises in Supersymmetry with conserved R-parity (SUSY) 
but also in Extra Dimensions with KK-parity (UED).® Current cosmolog- 
ical data, mostly through the WMAP satellite measurements of the CMB, 
determine the DM density in the Universe with a 6 % relative accuracy.°? 
By the next decade, the PLANCK satellite will push this uncertainty to 
~ 1%, or below.© Additional astrophysical data manifest a possible ev- 
idence of DM annihilation. The EGRET data show excess of y emission 
in the inner galaxy, which has been interpreted as due to DM®! and the 
WMAP data itself may show a signal of synchrotron emission in the Galac- 
tic center.°? These data, if confirmed, may be used to further constrain 
the DM properties. Ground-based DM searches are also approaching the 
stage where their sensitivity is at the level predicted by Supersymmetry 
for some combinations of parameters.®? The next decades promise to be 
a time when accelerator experiments will provide new breakthroughs and 
highly accurate data to gain new insights, not only on fundamental ques- 
tions in particle physics, but also in cosmology, when studied alongside the 
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observations from satellites and other experiments. The questions on the 
nature and the origin of DM offer a prime example of the synergies of new 
experiments at hadron and lepton colliders, at satellites and ground-based 
DM experiments. 

It is essential to study, in well defined, yet general enough, models, which 
are the properties of the new physics sector, such as masses and couplings, 
most important to determine the resulting relic density of the DM particles. 
Models exist which allow to link the microscopic particle properties to the 
present DM density in the Universe, with mild assumptions. If DM consists 
of WIMPs, they are abundantly produced in the very early Universe when 
T œ (t(sec))~!/? > 100 GeV and their interaction cross section is large 
enough that they were in thermal equilibrium for some period in the early 
universe. The DM relic density can be determined by solving the Boltzmann 
equation governing the evolution of their phase space number density. It 
can be shown that, by taking the WMAP result for the DM relic density 
in units of the Universe critical density,Q pmh?, the thermal averaged DM 
annihilation cross section times the co-moving velocity, < ov >, should be 
~ 0.9. From this result, the mass of the DM candidate can be estimated 


as: 
| ta? 
Mpm = \| ————- ~ 100 GeV. (2.7) 
8 < ov > 


A particle with mass M = O(100 GeV) and weak cross section would 
naturally give the measured DM density. It is quite suggestive that new 
physics, responsible for the breaking of electro-weak symmetry, also intro- 
duce a WIMP of about that mass. In fact, in essentially every model of 
electroweak symmetry breaking, it is possible to add a discrete symmetry 
that makes the lightest new particle stable. Often, this discrete symmetry 
is required for other reasons. For example, in Supersymmetry, the con- 
served R parity is needed to eliminate rapid proton decay. In other cases, 
such as models with TeV-scale extra dimensions, the discrete symmetry is 
a natural consequence of the underlying geometry. 

Data on DM density already set rather stringent constraints on the pa- 
rameters of Supersymmetry, if the lightest neutralino x? is indeed respon- 
sible for saturating the amount of DM observed in the Universe. It is useful 
to discuss the different scenarios, where neutralino DM density is compat- 
ible with the WMAP result, in terms of parameter choices in the context 
of the constrained MSSM (cMSSM), to understand how the measurements 
that the ILC provides can establish the relation between new physics and 
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DM. The cMSSM reduces the number of free parameters to just five: the 
common scalar mass, Mo, the common gaugino mass, mM/2, the ratio of the 
vacuum expectation values of the two Higgs fields, tan 8, the sign of the 
Higgsino mass parameter, u, and the common trilinear coupling, Ag. It is 
a remarkable feature of this model that, as these parameters, defined at 
the unification scale, are evolved down to lower energies, the electroweak 


symmetry is broken spontaneously and masses for the W and Z° bosons 
generated automatically. As this model is simple and defined by a small 
number of parameters, it is well suited for phenomenological studies. The 
cosmologically interesting regions in the mo - Mıj2 parameter plane are 
shown in Figure 2.10. As we move away from the bulk region, at small val- 
ues of mo and m1/2, which is already severely constrained by LEP-2 data, 
the masses of supersymmetric particles increase and so does the dark mat- 
ter density. It is therefore necessary to have an annihilation process, which 
could efficiently remove neutralinos in the early universe, to restore the DM 
density to the value measured by WMAP. Different processes define three 
main regions: i) the focus point region, where the x} contains an admixture 
of the supersymmetric partner of a neutral Higgs boson and annihilates to 
WtW- and Z°Z°, ii) the co-annihilation region, where the lightest slepton 
has a mass very close to Myo, iii) the A annihilation funnel, where M (x?) 
is approximately half that of the heavy A° Higgs boson, providing efficient 
s-channel annihilation, xx — A. In each of these regions, researchers at the 
ILC will be confronted with several different measurements and significantly 
different event signatures. 


Table 2.3. cMSSM parameters of benchmark points. 


Point mo mız tanB Ag Sgn(u) M(t) 


LCC1 100 250 10 -100 + 178 
LCC2 3280 300 10 0 + 175 
LCC3 210 360 40 0 + 178 
LCC4 380 420 53 0 + 178 


It is interesting to observe that the DM constraint, reduces the dimen- 
sionality of the cMSSM plane, by one unit, since the allowed regions are 
tiny lines in the mo - ™ 4/2 plane, evolve with tan @ and depend only very 
weakly on Ao. Representative benchmark points have been defined and 
their parameters are summarised in Table 2.3. Even though these points 
have been defined in a specific supersymmetric model, their phenomenol- 
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Fig. 2.10. The DM-favoured regions in the mo - m1/2 plane of the cMSSM and existing 
constraints. The precise locations of these regions vary with the tan 8 parameter and 
therefore the axis are given without units. The indicative locations of the four benchmark 
points adopted, are also given. Lower limits on the Higgs boson mass and, in a portion of 
the parameter space, the measurement of the b — sy decay branching fraction, exclude 
the region at low values of m1/2. A discrepancy of the measured anomalous magnetic 
moment of the muon value with the SM prediction would favour the region on the left 
of the curve labeled g — 2. 


ogy is common to the more general supersymmetric solutions and we shall 
soon discuss the extension of results derived in this constrained model to 
the general MSSM. There are several features which are common to all 
these regions. First, the relic density depends on the mass of the lightest 
neutralino and of few additional particles, close in mass to it. The heavier 
part of the SUSY spectrum decouples from the value of Qyh?. This is of 
particular importance for the ILC. Running at ys < 1 TeV, the ILC will 
not be able to study supersymmetric particles exceeding ~450-490 GeV, 
in particular scalar quarks and heavy Higgs bosons in some regions of the 
parameter phase space. But, independently of the LHC results, the ILC 
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will either observe and measure these particles if they may be relevant to 
determine the relic DM density, or it will set bounds that ensure their 
decoupling. A second important observation is that Q h? typically de- 
pends on SUSY parameters which can be fixed by accurate measurements 
of particle masses, particle mass splittings, decay branching fractions and 
production cross sections. In some instances the availability of polarised 
beams is advantageous. The LHC can often make precise measurements of 
some particles, but it is difficult for the LHC experiments to assemble the 
complete set of parameters needed to reconstruct annihilation cross section. 
It is also typical of supersymmetry spectra to contain light particles that 
may be very difficult to observe in the hadron collider environment. The 
ILC, in contrast, provides just the right setting to obtain both types of 
measurements. Again, it is not necessary for the ILC to match the energy 
of the LHC, only that it provides enough energy to see the lightest charged 
particles of the new sector. 

Rather detailed ILC analyses of the relevant channels for each bench- 
mark point have been performed,®”” based on parametric simulation, 
which includes realistic detector performances and effects of the ILC beam 
characteristics. It has been assumed that the ILC will be able to provide 
collisions at centre-of-mass energies from 0.3 TeV to 0.5 TeV with an in- 
tegrated luminosity of 500 fb~! in a first phase of operation and then its 
collision energy can raised to 1 TeV to provide an additional data set of 
1 ab~!, corresponding to an additional three to four years of running. Re- 
sults are summarised in terms of the estimated accuracies on masses and 
mass differences in Table 2.4. 

In order to estimate the implications of these ILC measurements on 
the estimation of neutralino dark DM density 0.h?, broad scans of the 
multi-parameter supersymmetric phase space need to be performed. For 
each benchmark point, the soft parameters (masses and couplings) at the 
electroweak scale can be computed with the full 2-loop renormalization 


9.7 Super- 


group equations and threshold corrections using Isajet 7.6 
symmetric loop corrections to the Yukawa couplings can also be included. 
The electroweak-scale MSSM parameters are extracted from the high scale 
cMSSM parameters. The dark matter density Qh? can be estimated using 
the DarkSUSY” and Micromegas”? programs. These programs use the same 
Isajet code to determine the particle spectrum and couplings, including 
the running Yukawa couplings, and compute the thermally averaged cross 
section for neutralino annihilation, including co-annihilation and solve the 


equation describing the evolution of the number density for the DM candi- 
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Table 2.4. Summary of the accuracies (in GeV) on the main 
mass determinations by the ILC at 0.5 TeV for the four 
benchmark points. Results in [] brackets also include ILC 
data at 1 TeV. 


5M (x9) 0.05 1.0 0.1 [+ 1.4] 
5M (Er) + 0.05 - [+ 1.0] [+ 0.6] 
56M (71) + 0.3 = 0.5 0.9 
5M (72) +11 - - - 
5(M(ur)— M(X?) 0.2 [0.2] 0.6 
ô(M (71) — M(x9)) 0.3 - 1.0 1.0 
6(M (#2) — M(x) +11 [+ 3.0] 

5(M (x3) — M(X?) 0.07 0.3 0.6 [+ 1.8] 
(M (X5) — M(x) 4.0 0.2 [£20] [+ 2.0] 
5(M (xt) — M(x9)) 0.6 0.25 [40.7 2.0 
5(M(xq)- M(xT)) [+30] - = [42.0] 2.0 
5M(A°) [+ 1.5] z [+ 0.8] [+ 0.8] 
ôrT(A?) È [+ 1.2) [41.2] 


date. While the assumptions of the cMSSM are quite helpful for defining 
a set of benchmark points, the cMSSM is not representative of the generic 
MSSM, since it implies several mass relations, and its assumptions have no 
strong physics justification. Therefore, in studying the accuracy on Qyh?, 
the full set of MSSM parameters must be scanned in an uncorrelated way 
and the mass spectrum evaluated for each parameter set. A detailed study 
has recently been performed.”4 I summarise here some of the findings, 
Table 2.5 gives results for the neutralino relic density estimates in MSSM 
for the LHC, the ILC at 0.5 TeV and the ILC at 1 TeV. 

The LCC1 point is in the bulk region and the model contains light 
sleptons, with masses just above that of the lightest neutralino. The most 


Table 2.5. Summary of the relative accuracy 
2 
Sixt for the four benchmark points obtained with 


full SUSY scans. 


Benchmark Qh? LHC ILC ILC 
Point 0.5 TeV 1.0 TeV 
LCC1 0.192 0.072 0.018 0.024 
LCC2 0.109 0.820 0.140 0.076 
LCC3 0.101 1.670 0.500 0.180 


LCC4 0.114 4.050 0.850 0.190 
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Fig. 2.11. Mass reconstruction at ILC: (a) momentum endpoint in ñ > ux? (from 
Ref. 76) and (b) threshold scan for ete~ AtA (from Ref. 23) 


important annihilation reactions are those with t-channel slepton exchange. 
At the LHC, many of the SUSY spectrum parameters can be determined 
from kinematic constraints. At the ILC masses can be determined both by 
the two-body decay kinematics of the pair-produced SUSY particles and 
by dedicated threshold scans. Let us consider the two body decay of a 
scalar quark ¢ — qx?. If the scalar quarks are pair produced e*e™ — ĝĝ, 
Eq = Eveam and the x? escapes undetected, only the q (and the g) are 
observed in the detector. In a 1994 paper, J. Feng and D. Finnell” pointed 
out that the minimum and maximum energy of production for the quark 
can be related to the mass difference between the scalar quark g and the 


xi 


oe Ebveam L me my 
Eman, min = 2 (1 £ 1— EZ ) (1 = TA (2.8) 


The method can also be extended to slepton decays i= x9, which 
share the same topology, and allows to determine slepton mass once that 
of the neutralino is known or determine a relation between the masses 
and get mo if that of the slepton can be independently measured. The 
measurement requires a precise determination of the endpoint energies of 
the lepton momentum spectrum, Emin and Emax. It can be shown that 
accuracy is limited by beamstrahlung, affecting the knowledge of Ebeam in 
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the equation above, more than by the finite momentum resolution, dp/p of 
the detector. The ILC has a second, and even more precise, method for mass 
measurements. The possibility to precisely tune the collision energy allows 
to perform scans of the onset of the cross section for a specific SUSY particle 
pair production process. The particle mass and width can be extracted from 
a fit to the signal event yield as function of \/s. The accuracy depends rather 
weakly on the number of points, N, adopted in the scan and it appears that 
concentrating the total luminosity at two or three different energies close to 
the threshold is optimal.”””* The mass accuracy, 6m can be parametrised 
as: 


1+0.36/VN 
ôm ~ AE — 2.9 
V18NLo (2) 
for S-wave processes, where the cross section rises as 8 and as 
1 1 .38/ VN 
ôm ~ 7 7 eee OAN (2.10) 


N1/4 26NLo 


for P-wave processes, where the cross section rises as 6°. The combina- 
tion of these measurements allows the ILC to determine the x} mass to 
+0.05 GeV, which is two orders of magnitude better than the anticipated 
LHC accuracy, while the mass difference between the 7, and the x} can be 
measured to +0.3 GeV, which is more than a factor ten better. Extension 
of ILC operation to 1 TeV gives access to the e6e~ — H? A? process. As a 
result of the precision of these measurements, the ILC data at 0.5 TeV will 
allow to predict the neutralino relic density to +2 % and the addition of 
1.0 TeV data will improve it to +0.25 %. It is suggestive that this accuracy 
is comparable, or better, than that expected by the improved CMB survey 
by the PLANCK mission. For comparison, the LHC data should provide a 
+7 % accuracy. This already a remarkable result, due the fact that, a large 
number of measurements will be available at the LHC and SUSY decay 
chains can be reconstructed. Still, the overall mass scale remains uncertain 
at the LHC. The direct mass measurements on the ILC data remove this 
uncertainty. 

The LCC1 point is characterised by the relatively low SUSY mass scale, 
most of the particles can be observed at the LHC and their masses accu- 
rately measured at the ILC. However, in more general scenarios, the infor- 
mation available from both collider will be more limited. This is the case at 
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Fig. 2.12. DM-motivated SUSY 7 reconstruction at ILC: determination of the stau- 
neutralino mass difference from a reconstruction of ete~ — 7,71 at 0.5 TeV for LCC3 
(from Ref. 69). 


the LCC2 point, located in the focus point region, where masses of scalar 
quarks, sleptons and heavy Higgs bosons are very large, typically beyond 
the ILC but also the LHC reach, while gauginos masses are of the order 
of few hundreds GeV, thus within the kinematical domain of the ILC. In 
this specific scenario, the LHC will observe the SUSY process g — qqy and 
the subsequent neutralino and chargino decays. Still the neutralino relic 
density can only be constrained within +40% and the hypothesis 2,.h? = 0, 
namely that the neutralino does not contribute to the observed dark mat- 
ter density in the universe, cannot be ruled out, based only on LHC data. 
At a 0.5 TeV collider, the main SUSY reactions are ete~ — xfx] and 
ete” — y$x§. Operation at 1 TeV gives access also to ete~ — yf x3 and 


ete — y3x{. Not only the gaugino mass splittings but also the polarised 
neutralino and chargino production cross section can be accurately deter- 
mined at the ILC.®° These measurements fix the gaugino-Higgsino mixing 
angles, which play a major role in determining the neutralino relic density. 
The decoupling of the heavier, inaccessible part of the SUSY spectrum, 
can be insured with the data at the highest energy. The combined ILC 
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Fig. 2.13. DM-motivated SUSY Higgs reconstruction at ILC: (a) an event ete~ — 
A? H? — bbbb at 1 TeV in the LDC detector and (b) di-jet invariant mass spectrum for 
ete— — A? H? — bbbb at 1 TeV for LCC4 (from Ref. 70). 


data at 0.5 TeV and 1 TeV provide an estimate of the neutralino relic den- 
sity to +8 % accuracy, which matches the current WMAP precision. The 
characteristics featured by the LCC2 point persist, while the SUSY masses 
increase, provided the gaugino-Higgsino mixing angle remains large enough. 
This DM-motivated region extends to SUSY masses which eventually ex- 
ceed the LHC reach, highlighting an intriguing region of parameters where 
the ILC can still observe sizable production of supersymmetric particle, 
compatible with dark matter data, while the LHC may report no signals of 
New Physics.8? 

Instead, the last two points considered, LCC3 and LCC4, are represen- 
tative of those regions where the neutralino relic density is determined by 
accidental relationships between particle masses. Other such regions may 
also be motivated by baryogenesis constraints. The determination of the 
neutralino relic density, in such scenarios, depends crucially on the preci- 
sion of spectroscopic measurements, due to the large sensitivity on masses 
and couplings. The conclusions of the current studies are that the LHC 
data do not provide quantitative constraints. On the contrary, the ILC can 
obtain interesting precision, especially when high energy data is available. 

The LCC3 point is in the so-called 7 co-annihilation region. Here, the 
mass difference between the lightest neutralino, x9, and the lightest scalar 
tau, 71, is small enough that 7? — Ty can effectively remove neutralinos 
in the early universe. The relative density of 7 particles to neutralinos scales 


mg —mix 
ase  ™x ,so this scenario tightly constrain the mz—m, mass difference. 
Here, the precise mass determinations characteristic of LCC1 will not be 
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Fig. 2.14. Relic DM density determination based on simulation from LHC, ILC at 
0.5 TeV and ILC at 1.0 TeV for the four SUSY benchmark points studied: a) LCC1, b) 
LCC2, c) LCC3 and d) LCC4. The plots show the probability density functions of the 
Qh? values corresponding to MSSM points compatible with the accelerator data (from 
Ref. 74). 


available: at 0.5 TeV, the ILC will observe a single final state, TtT + 
Enissing, from the two accessible SUSY processes,°? ete~ — 7171, 7 > TX? 
and ete~ > x?x8, x9 = x?7 — x?xor7. The signal topology consists of 
two T-jets and missing energy. Background processes, such as ete~ — ZZ 
can be suppressed using cuts on event shape variables. The mass splitting 
can be determined by a study of the distribution of the invariant mass of the 
system made by the two 7-jets and the missing energy vector, Mj, jo Emissing' 
In this variable, the remaining SM background is confined to low values and 
the shape and upper endpoint of the 7,7, contribution depends on the stau- 
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neutralino mass difference, AM = Mz, — Myo. Templates functions can be 
generated for different values of AM and the mass difference is extracted 
by a x? fit of these templates to the “data”. As the AM value decreases, 
the energy available to the 7 leptons decreases. Since r decays involve 
neutrinos, additional energy is lost from detection. When the rr system 
becomes soft, the four fermion background process ee — ee7T, the so-called 
yy background which has cross sections at the nb level, makes its detection 
increasingly difficult. What makes possible to reject these yy events is 
the presence of the two energetic primary electrons at small angle w.r.t. 
the beamline.” This is a significant challenge for low angle calorimetry, 
since the electron has to be detected in an hostile environment populated 
by a large number of other electrons, of lower energy, arising from pairs 
created during the bunch collision.8°°! A detailed study,” performed for 
a statistics of 500 fb—!, shows that values of AM as small as 5 GeV can be 
measured at the ILC, provided the primary electrons can be vetoed down to 
17 mrad. In the specific case of the LCC3 point, where the mass splitting, 
AM, is 10.8 GeV, an accuracy of 1 GeV can be achieved. Heavier gauginos, 
as well as the A’ boson, become accessible operating the ILC at 1 TeV. 
These data constrain both the mixing angles and tan 8. As a result the 
neutralino relic density can be estimated with an 18 % accuracy. Finally, 
the LCC4 point, chosen in the A funnel, has the DM density controlled 
by the yy — A process. This point is rather instructive in terms of the 
discovery-driven evolution of a possible experimental program at the ILC. 
The ILC can obtain the neutralino and 7 masses at 0.5 TeV, following 
the same technique as for LCC3. We would also expect LHC experiments 
to have observed the A? boson, but it is unlikely M4 could be determined 
accurately in pp collisions, since the available observation mode is the decay 
in 7 lepton pairs. At this stage, it would be apparent that the mass relation 
between the neutralino mass, accurately measured by the ILC at 0.5 TeV, 
and the A boson mass, from the LHC data, is compatible with M4 ~ 2M, 
as required for the s-channel annihilation process to be effective. Three 
more measurements have to be performed at the ILC: the A? mass, M4, 
and width T4 and the u parameter, which is accessible through the mass 
splitting between heavier neutralinos, x8, x9 and the lighter x?, x8. All 
these measurements are available by operating the ILC at 1 TeV. M4 and 
T4 can be determined by studying the A? production in association with 
a H? boson, in the reaction e+e~ —> A} H? — bbbb. This process results in 
spectacular events with four b jets, emitted almost symmetrically, due to 
low energy carried by the heavy Higgs bosons (see Figure 2.13a). The cross 
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section, for the parameters of LCC4 corresponding to Ma = 419 GeV, is 
just 0.9 fb highlighting the need of large luminosity at the highest energy. 
Jet flavour tagging and event shape analysis significantly reduces the major 
multi-jet backgrounds, such as WW, ZZ and tt. The SM bbbb electro-weak 
background has a cross section of ~3 fb, but since it includes Z° or h? as 
intermediate states it can be efficiently removed by event shape and mass 
cuts. After event selection, the A? mass and width must be reconstructed 
from the measured di-jet invariant masses. This is achieved by pairing jets 
in the way that minimises the resulting di-jet mass difference, since the 
masses of the A and H bosons are expected to be degenerate within a few 
GeV, and the di-jet masses are computed by imposing constraints on energy 
and momentum conservation to improve the achievable resolution and gain 
sensitivity to the boson natural width (see Figure 2.13b). The result is a 
determination of the A mass to 0.2 % and of its width to ~15 % if a sample 
of 2 ab~! of data can be collected. The full set of ILC data provides a 
neutralino relic density evaluation with 19 % relative accuracy. The full 
details of how these numbers were obtained can be found in Ref.” . 
SUSY offers a compelling example for investigating the complementar- 
ity in the search and discovery of new particles and in the study of their 
properties at the LHC and ILC. The connection to cosmology, through the 
study of dark matter brings precise requirements in terms of accuracy and 
completeness of the anticipated measurements and puts emphasis on sce- 
narios at the edges of the parameter phase space. The interplay of satellite, 
ground-based and collider experiments in cosmology and particle physics 
will be unique and it will lead us to learn more about the structure of our 
Galaxy and of the Universe as well as of the underlying fundamental laws 
of the elementary particles. This quest will represent an major effort for 
science in the next several decades. The scenarios discussed above high- 
light the essential role of the ILC in this context. It will testing whether 
the particles observed at accelerators are responsible for making up a size- 
able fraction of the mass of the Universe, through precision spectroscopic 
measurements. The data obtained at the ILC will effectively remove most 
particle physics uncertainties and become a solid ground for studying dark 
matter in our galaxy through direct and indirect detection experiments.*4 


2.4.3. Indirect Sensitivity to New Physics at the ILC 


Beyond Supersymmetry there is a wide range of physics scenarios invoking 
new phenomena at, and beyond, the TeV scale. These may explain the 
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origin of electro-weak symmetry breaking, if there is no light elementary 
Higgs boson, stabilise the SM, if SUSY is not realised in nature, or embed 
the SM in a theory of grand unification. The ILC, operating at high en- 
ergy, represents an ideal laboratory for studying this New Physics in ways 
that are complementary to the LHC.®°°° Not only it may directly produce 
some of the new particles predicted by these theories, the ILC also retains 
an indirect sensitivity, through precision measurements of virtual correc- 
tions to electro-weak observables, when the new particle masses exceed the 
available centre-of-mass energy. 

One of the simplest of such SM extensions consists of the introduction 
of an additional U(1) gauge symmetry, as predicted in some grand unified 
theories.87°* The extra Z’ boson, associated to the symmetry, naturally 
mixes with the SM Z°. The mixing angle is already strongly constrained, 
by precision electroweak data, and can be of the order of few mrad at most, 
while direct searches at Tevatron for a new Z’ boson set a lower limit on its 
mass around 800 GeV, which may reach 1 TeV by the time the LHC will 
start searching for such a state. The search for an extended gauge sector 
offers an interesting framework for studying the ILC sensitivity to scales 
beyond those directly accessible. It also raises the issue of the discrimination 
between different models, once a signal would be detected. The main classes 
of models with additional Z’ bosons include Eg inspired models and left- 
right models (LR). In the Eg models, the Z’ fermion couplings depend on 
the angle, 0g, defining the embedding of the extra U(1) in the Eg group. 
At the ILC, the indirect sensitivity to the mass of the new boson, Mz, 
can be parametrised in terms of the available integrated luminosity, £, and 
centre-of-mass energy, ys. A scaling law for large values of Mz can be 
obtained by considering the effect of the Z’ — y interference in the two 
fermion production cross section o(ete~ — ff) (o pF in the following). For 
s << Mż, and assuming the uncertainties dc to be statistically dominated, 
we obtain the following scaling for the difference between the SM cross 
section and that in presence of the Z’, in units of the statistical accuracy: 

SM SM+Z'| i 

aes me, Ve (2.11) 

from which we can derive that the indirect sensitivity to the Z’ mass scales 
with the square of the centre-of-mass energy and the luminosity as: 

Mz x (s£). (2.12) 


In a full analysis, the observables sensitive to new physics contribution in 
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two-fermion production are the cross section of7, the forward-backward 


asymmetries At. and the left-right asymmetries Ave. The ILC gives 
us the possibility to study a large number of reactions, epe;, eker `> 
(uti + dd), s5, cé, bb, tt, ete~, utu, rtr with final states of well 
defined flavour and, in several cases, helicity. In order to achieve this, jet 
flavour tagging is essential to separate b quarks from lighter quarks and c 
quarks from both 6 and light quarks. Jet-charge and vertex-charge recon- 
struction allows then to tell the quark from the antiquark produced in the 
same event.8™ Similarly to LEP and SLC analyses, the forward-backward 
asymmetry can be obtained from a fit to the flow of the jet charge QI, 
defined as QJ = eee where q; is the particle charge, p; its mo- 
mentum, T the jet thrust axis and the sum is extended to all the particles 
in a given jet. Another possible technique uses the charge of secondary 
particles to determine the vertex charge and thus the quark charge. The 
application of this technique to the ILC has been studied in some details 
in relation to the optimisation of the Vertex Tracker.®! At ILC energies, 
the ete~ — ff cross sections are significantly reduced, compared to those 
at LEP and SLC: at 1 TeV the cross section o(e+e~ — bb) is only 100 fb, 
so high luminosity is essential and new experimental issues emerge. At 
1 TeV, the ILC beamstrahlung parameter doubles compared to 0.5 TeV, 
beam-beam effects becoming important, and the primary e*e7 collision is 
accompanied by yy — hadrons interactions.°? Being mostly confined in the 
forward regions, this background may reduce the polar angle acceptance for 
quark flavour tagging and dilute the jet charge separation using jet charge 
techniques. The statistical accuracy for the determination of 0 sF, Al and 
Ae has been studied, for u* u7 and bb, taking the ILC parameters at \/s = 
1 TeV. The additional particles from the yy background cause a broadening 
of the QJ distribution and thus a dilution of the quark charge separation. 
Detailed full simulation and reconstruction is needed to fully understand 
these effects. Despite these backgrounds, the anticipated experimental ac- 
curacy in the determination of the electro-weak observables in two-fermion 
processes at 1 TeV is of the order of a few percent, confirming the ILC role 
as the precision machine. Several scenarios of new physics have been inves- 
tigated.°%.°4 The analysis of the cross section and asymmetries at 1 TeV 
would reveal the existence of an additional Z’ boson up to œ 6-15 TeV, 
depending on its couplings. As a comparison the LHC direct sensitivity 
extends up to approximately 4-5 TeV. The ILC indirect sensitivity also 
extends to different models on new physics, such as 5-dimensional exten- 
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Fig. 2.15. Indirect sensitivity to Z’ bosons at ILC: (a) mass sensitivity to different Z’ 
models for 1 ab~+ of data at different centre-of-mass energies compared to that of LHC 
and (b) accuracy on leptonic couplings for a 5 TeV Z’ boson (from Ref. 23) 


sion of the SM with fermions on the boundary for a compactification where 
scales up to about 30 TeV can be explored. Finally, fermion compositeness 
or the exchange of very heavy new particles can be described in terms of 
effective four-fermion contact interactions.°° The interaction depends on a 
scale A = Mx/g, where Mx is the mass of the new particle and g the cou- 
pling. Limits to this scale A can be set up to ~ 100 TeV, which shows that 
the ILC sensitivity to new phenomena can exceed its centre-of-mass energy 
by a significant factor. In order to maximise this indirect sensitivity to new 
physics, the precision of the SM predictions should match the experimental 
accuracy. Now, at TeV energies, well above the electroweak scale,the ILC 
will face the effects of large non-perturbative corrections. Large logarithms 
x a” log?"(M?/s) arise from the exchange of collinear, soft gauge bosons 
and are known as Sudakov logarithms.®° At 1 TeV the logarithmically en- 
hanced W corrections to opg, of the form a log?(M?,/s) and a log( M8, /s) 
amount to 19% and -4% respectively. The effect of these large logarithmic 
corrections has been studied in some details.'%°” It will be essential to 
promote a program of studies to reduce these theoretical uncertainties, to 
fully exploit the ILC potential in these studies. 


The International Linear Collider 85 
2.4.4. Run Plan Scenario 


One of the points of strength of the ILC is in its remarkable flexibility of 
running conditions. Not only the centre-of-mass energy can be changed 
over approximately an order of magnitude, but the beam particle and their 
polarization state can be varied to suit the need of the physics processes 
under study. At the same time, the ILC program is most diversified and 
data taken at the same centre-of-mass energy may be used for very differ- 
ent analyses, such as precise top mass determination, Higgs boson studies 
and reconstruction of SUSY decays. This has raised concerns whether the 
claimed ILC accuracies can be all achieved with a finite amount of data. A 
dedicated study was performed in 2001, under the guidance of Paul Gran- 
nis, taking two physics scenarios with Supersymmetry realised at relatively 
low mass, one being the LCC1 benchmark point, rich in pair-produced 
particles and requiring detailed threshold scans.®® The study assumes a 
realistic profile for the delivered luminosity, which increases from 10 fb~! 
in the first year to 200 fb~! in the fifth year and 250 fb~! afterward, for 
a total integrated equivalent luminosity f £ = 1 ab~!. The proposed run 
plan starts at the assumed maximum energy of 0.5 TeV for a first determi- 
nation of the sparticle masses through the end-point study and then scans 
the relevant thresholds, including tt in short runs with tuned polarization 


states. A summary is given in Table 2.6. This plan devotes approximately 


Table 2.6. ILC Run plan scenario for LCC1. 


Beams ys Pol. f£ Comments 
(TeV) (fb—1) 


Te~ 0.500 L/R 335 Sit at max. energy for sparticle endpoint measurements 
e7 0.270 L/R 100 Scan x?x$ (R pol.) and 7171 (L pol.) 
Te 0.285 R 50 Scan PRR 
e7 0.350 L/R 40 Scan tł, črēz (L& R pol.), x} x] (L pol.) 
e7 0.410 L/R 100 Scan 772 
e 0.285 RR 10 Scan for čr mass 


two third of the total luminosity at, or near, the maximum energy, so the 
program will be sensitive to unexpected new phenomena at high energy, 
while providing accurate measurements of masses through dedicated scans. 
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2.5. Sensors and Detectors for the ILC 


The development of the ILC accelerator components and the definition of 
its physics case has been paralleled by a continuing effort in detector de- 
sign and sensor R&D. This effort is motivated by the need to design and 
construct detectors which match the ILC promise to provide extremely ac- 
curate measurements over a broad range of collision energies and event 
topologies. It is important to stress that, despite more than a decade of 
detector R&D for the LHC experiments, much still needs to be done to 
obtain sensors matching the ILC requirements. While the focus of the 
LHC-motivated R&D has been on sensor radiation hardness and high trig- 
ger rate, the ILC, with its more benign background conditions and lower 
interaction cross sections, admits sensors of new technology which, in turn, 
have better granularity, smaller thickness and much improved resolution. 
Sensor R&D and detector design are being carried out world-wide and are 
starting deploying prototype detector modules on test beamlines. 


2.5.1. Detector Concepts 


The conceptual design effort for an optimal detector for the ILC interaction 
region has probed a wide spectrum of options which span from a spherical 
detector structure to improved versions of more orthodox barrel-shaped de- 
tectors. These studies have been influenced by the experience with SLD at 
the SLC, ALEPH, DELPHI and OPAL at LEP, but also with ATLAS and 
CMS at the LHC. The emphasis on accurate reconstruction of the particle 
flow in hadronic events and thus of the energy of partons is common to all 
designs. The main tracker technology drives the detector designs presently 
being studied. Four detector concepts have emerged, named GLD, LDC, 
SiD and 4t Concept.®® A large volume, 3D continuous tracking volume 
in a Time Projection Chamber is the centerpiece of the GLD, the LDC 
and the so-called 4" Concept designs. The TPC is followed by an highly 
segmented electro-magnetic calorimeter for which these three concepts are 
contemplating different technologies A discrete tracker made of layers of 
high precision Silicon microstrip detectors, and a larger solenoidal field, 
which allows to reduce the radius, and thus the size, of the calorimeter 
is being studied in the context of the SiD design. Dedicated detector de- 
sign studies are being carried out internationally!©°!°! to optimise, through 
physics benchmarks,!°? the integrated detector concepts. Such design ac- 
tivities provide a bridge from physics studies to the assessment of priorities 
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Fig. 2.16. View of the four ILC detector concepts presently being studied: GLD (upper 
left), LDC (upper right), SiD (lower left) and 4¢” Concept (lower right). 


in detector R&D and are evolving towards the completion of engineered 
design reports at the end of this decade, synchronously with that foreseen 
for the ILC accelerator. 


2.5.2. Vertexing and Tracking 


The vertex and main tracker detectors must provide jet flavour identifica- 
tion and track momentum determination with the accuracy which makes 
the ILC such a unique facility for particle physics. The resolution in extrap- 
olating charged particle trajectories to their production point, the so-called 
impact parameter, is dictated by the need to distinguish Higgs boson decays 
to cé from those to bb pairs, but also r+7~ and gluon pairs, as discussed 
in section 2.4.1. In addition, vertex charge measurements put emphasis on 
precise extrapolation of particle tracks down to very low momenta. Tag- 
ging of events with multiple b jets, such as ete~ — H? A? — bbbb, discussed 
in section 2.4.2, underscores the need of high tagging efficiency, €b, since 
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the overall efficiency scales as GN , where N is the number of jets to be 
tagged. This is best achieved by analysing the secondary vertex structures 
in hadronic jets. A B meson, from a Higgs boson produced at 0.5 TeV, has 
an average energy of £gy/s/4 ~ 100 GeV, where xg ~ 0.7 represents the 
average b fragmentation function, or a y value of ~ 70. Since cr ~ 500 um, 
the average decay distance @ycr is 3.5 mm and the average impact parame- 
ter, Gycr sin @, is 0.5 mm. In comparison, a D meson from a H — cc decay 
has a decay length of 1.3 mm. More importantly, the average charged de- 
cay multiplicity for a B meson is 5.1, while for a D meson is 2.7. Turning 
these numbers into performance requirements sets the target accuracy for 
the asymptotic term a and the multiple scattering term b defining the track 
extrapolation resolution in the formula 


Oextrapolation = & p= (2.13) 
Pt 


The ILC target values are compared to those achieved by the DELPHI 
experiment at LEP, those expected for ATLAS at the LHC and the 
best performance ever achieved at a collider experiment, that of SLD, in 
Table 2.7. This comparison shows that the improvements required for ILC 


Table 2.7. Values for the asymptotic 
term a and multiple scattering term b 
defining the track extrapolation reso- 
lution required for the ILC compared 
to those obtained by other collider 
experiments. 


Experiment a (um) b (um/GeV) 


ILC 5 10 
DELPHI 28 65 
ATLAS 15 75 
SLD 8 33 


on state-of-the-art technology is a factor 2-5 on asymptotic resolution and 
another factor 3-7 on the multiple scattering term. 

At the ILC, particle tracks in highly collimated jets contribute a local 
track density on the innermost layer of 0.2-1.0 hits mm~? at 0.5 TeV, to 
reach 0.4-1.5 hits mm~? at 1.0 TeV. Machine-induced backgrounds, mostly 
pairs, add about 3-4 hits mm~?, assuming that the detector integrates 80 
consecutive bunch crossings in a train. These values are comparable to, 
or even exceed, those expected on the innermost layer of the LHC detec- 
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tors: 0.03 hits mm~? for proton collisions in ATLAS and 0.9 hits mm? for 
heavy ion collisions in ALICE. Occupancy and point resolution set the pixel 
size to 20x20 um? or less. The impact parameter accuracy sets the layer 
material budget to < 0.15% Xo/layer. This motivates the development of 
thin monolithic pixel sensors. Charge coupled devices (CCD) have been 
a prototype architecture after the success of the SLD VXD3.!9% However, 
to match the ILC requirements in terms of radiation hardness and read- 
out speed significant R&D is needed. New technologies, such as CMOS 
active pixels,!°4 SOI and DEPFET! sensors, are emerging as promis- 
ing, competitive alternatives, supported by an intensive sensor R&D effort 
promoted for the ILC.1°7 

The process ete — H°Z°, H® — X, Z? — 4+4 gives access to 
Higgs production, irrespective of the Higgs decay properties. Lepton mo- 
menta must be measured very accurately for the recoil mass resolution to 
be limited by the irreducible smearing due to beamstrahlung. Since the 
centre-of-mass energy ys = Ep + Ez is known and the total momentum 
pH +pz =0, the Higgs mass, My can be written as: 


Mh = Eq —piy = (vVs-Ez) -pz = s+ EZ—2y/sEz—py = 8—2/sEz+Mz 
(2.14) 


In the decay Z? —> pty, Ez = E,+ + E,- so that the resolution on 
My depends on that on the muon momentum. In quantitative terms the 


resolution required is 
ép/p? < 2 x 1075 (2.15) 


A comparison with the performance of trackers at LEP and LHC is given in 
Table 2.8. The ability to tag Higgs bosons, independent on their decay mode 


Table 2.8. Values for the momentum res- 
olution 5p/p for the main tracker and 
the full tracking system at ILC, LEP and 
LHC. These values do not include the vertex 
constraint. 


Experiment Main Tracker Full Tracker 


Only 
ILC 1.5 x 1074 5x 1075 
ALEPH 1.2 x 1073 5x 107-4 


ATLAS = 2x 1074 


is central to the ILC program in Higgs physics. A degraded momentum 


resolution would correspond to larger background, mostly from ete~ — 
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ZZ*, being accepted in the Higgs signal sample. This degrades the accuracy 
on the determination of the Higgs couplings both in terms of statistical and 
systematic uncertainties. The particle momentum is measured through its 
bending radius R in the solenoidal magnetic field, B. The error on the 
curvature, k = 1/R, for a particle track of high momentum, measured at 
N equidistant points with an accuracy, o, over a length L, applying the 
constraint that it does originate at the primary vertex (as for the leptons 
from the Z° in the Higgstrahlung reaction) is given by:1°8 
o 320 
NN A (2.16) 

This shows that the same momentum resolution can be achieved either by a 
large number of measurements, each of moderate accuracy, as in the case of 
a continuous gaseous tracker, or by a small number of points measured with 
high accuracy, as in the case of a discrete Si tracker. Continuous tracking 
capability over a large area, with timing information and specific ionization 
measurement, and its robust performance make the Time Projection Cham- 
ber an attractive option for precision tracking at the ILC. The introduction 
of Micro Pattern Gaseous Detectors!?®110 (MPGD) offers significant im- 
provements in terms of reduced F x B, larger gains, ion suppression and 
faster, narrower signals providing better space resolution. Improving on 
the space resolution requires an optimal sampling of the collected charge, 
while the high solenoidal magnetic field reduces the diffusion effects. Sev- 
eral paths are presently being explored with small size prototypes operated 
on beamlines and in large magnetic fields.41}:1!? 

A multi-layered Si strip detector tracker in an high B field may offer 
a competitive dp/p? resolution with reduced material budget and afford 
a smaller radius ECAL, thus reducing the overall detector cost. This is 
the main rationale promoting the development of an all-Si concept for the 
main tracker, which follows the spirit of the design of the CMS detector at 
LHC. Dedicated conceptual design and module R&D is being carried out 
as a world-wide program.'!? There is also considerable R&D required for 
the engineering of detector ladders, addressing such issues as mechanical 
stability and integration of cooling and electrical services. These modules 
may also be considered as supplemental tracking devices in a TPC-based 
design to provide extra space points, with high resolution, and in end- 
cap tracking planes. Assessing the required detector performance involves 
realistic simulation and reconstruction code accounting for inefficiencies, 
noise, overlaps and backgrounds. 
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2.5.3. Calorimetry 


The ILC physics program requires precise measurements of multi-jet 
hadronic events, in particular di-jet invariant masses to identify W, Z 
and Higgs bosons, through their hadronic decays. An especially demand- 
ing reaction is ete7 — Z°H°H°, which provides access to the triple 
Higgs coupling as discussed in section 2.4.1. The large background from 
ete — Z°Z°Z° can be reduced only by an efficient H°/Z° separation, 
based on their masses. This impacts the parton energy resolution through 
the measurement of hadronic jets. Detailed simulation®! shows that a jet 
= = ~ os is required, in order to achieve an interest- 
ing resolution on the gyp coupling. The analysis of other processes, such 
as ete” — WtW~vi and Higgs hadronic decays, leads to similar con- 
clusions.!!5 In the case of the determination of H? — WtW- branching 
fractions, the statistical accuracy degrades by 22 % when changing the jet 


energy resolution from 232 to Weck Such performance is unprecedented 


VE 


energy resolution 
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Fig. 2.17. W= and Z? gauge boson pair production separation at the ILC: invariant 
mass of the first di-jet pair vs. that of the second for a sample of WW and ZZ for two 
different assumptions on the jet energy resolution (a) = and (b) a- (from Ref. 114). 


and requires the development of an advanced calorimeter design as well 
as new reconstruction strategies. The most promising approach is based 
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on the particle flow algorithm (PFA). The energy of each particle in an 
hadronic jet is determined based on the information of the detector which 
can measure it to the best accuracy. In the case of charged particles, this 
is achieved by measuring the particle bending in the solenoidal field with 
the main tracker. Electromagnetic neutrals (y and 7°) are measured in the 
electromagnetic calorimeter and hadronic neutrals (KY, n) in the hadronic 
calorimeter. The jet energy is then obtained by summing these energies: 


Eiet z Echarged + Eem neutral + Ehad neutral (2.17) 


each being measured in a specialised detector. The resolution is given by: 
2 2 2 2 2 

OB ict = Oo charged F Fem neutral F Ohad neutral E Ocon fusion: (2.18) 

Assuming the anticipated momentum resolution, og œ 0.11/VE for the 

e.m. calorimeter, og ~ 0.40/VE for the hadronic calorimeter and the frac- 

tions of charged, e.m. neutral and hadronic neutral energy in an hadronic 


jet we get: 
1 ES 
2 ay 2 charged 
Oo charged & (0.02GeV) To y ToGevyt (2.19) 
E; 
2 2 jet 
~ (0.6GeV )* ——— 2.2 

Fem neutral ( e ) 100GeV ( 0) 

Eje 
Ohad neutral ~ (13GeV)’ aay (2.21) 


In case of perfect energy-particle association this would correspond to a 
jet resolution ~ 0.14/VE. But a major source of resolution loss turns 
out to be the confusion term, Ocon fusion, Which originates from inefficien- 
cies, double-counting and fakes, which need to be minimised by an efficient 
pattern recognition. This strategy was pioneered by the ALEPH experi- 
ment at LEP, where a resolution ~ 0.60/VE was obtained, starting from 
the stochastic resolutions of og ~ 0.18/ VE for the e.m. calorimeter, and 
op ~ 0.85/VE for the hadronic calorimeter.!!® At hadron colliders, the 
possible improvement from using tracking information together with calori- 
metric measurements is limited, due to underlying events and the shower 
core size. On the contrary, at the ILC these limitations can be overcome, 
by developing an imaging calorimeter, where spatial resolution becomes as 
important as energy resolution. The minimisation of the confusion rate can 
then be obtained by choosing a large solenoidal field, B, and calorimeter 
radius, R, to increase the separation between charged and neutral particles 


The International Linear Collider 93 


Fig. 2.18. Visualisation of the imaging calorimeter for the ILC: simulated response of 
a SiW calorimeter to a jet from ete~ — WWT — jets at /s=0.8 TeV. 


in dense jets, a small Moliere radius, Rm, for the e.m. calorimeter, to reduce 
the transverse shower spread and small cells, Rpize, with large longitudinal 
segmentation. The distance between a neutral and a charged particle, of 
transverse momentum p;, at the entrance of the e.m. calorimeter located 
at a radius R is given by 0.15BR?/p;, where B is the solenoidal magnetic 
field. A useful figure of merit of the detector in terms of the particle flow 
reconstruction capability is then offered by: 

BR? 

R? R? 


pixel 


(2.22) 


which is a measure of the particle separation capability. The value of BR? 
is limited to about 60 Tm? by the mechanical stability. An optimal material 
in terms of Moliere radius is Tungsten, with Rm = 9 mm. In four-jet events 
at y/s=0.8 TeV, there are on average 28 GeV per di-jet carried by photons, 
which are deposited within 2.5 cm from a charged particle at the e.m. 
calorimeter radius. With pixel cells of order of 1 x 1 cm? to ensure sufficient 
transverse segmentation and 30 to 40 layers in depth, the e.m. calorimeter 
would consists of up to 30 M channels and 3000 m? of active Si. Due to the 
large amount of channels and the wish to use an absorber with the smallest 
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possible Moliere radius, the e.m. calorimeter is the main cost-driver of the 
ILC detector and its optimisation in terms of performance and cost requires 
a significant R&D effort. A Silicon-Tungsten calorimeter (SiW) was first 
proposed in the framework of the TESLA study!" and it is currently 
being pursued by large R&D collaborations in both Europe and the US. 
Alternative technologies are also being studied by the GLD and the 4t” 
Concept. This R&D program involves design, prototyping and tests with 
high energy particle beams and it is being carried out world-wide,!!® 1° 
supported by efforts on detailed simulation and reconstruction. 


2.6. Epilogue 


The ILC promises to complement and expand the probe into the TeV scale 
beyond the LHC capabilities, matching and improving its energy reach 
while adding precision. Its physics program will address many of the fun- 
damental questions of today’s physics from the origin of mass, to the nature 
of Dark Matter. After more than two decades of intense R&D carried out 
world-wide, the ete~ linear collider, with centre-of-mass energies up to 
1 TeV, has become technically feasible and a costed reference design is now 
available. Detectors matching the precision requirements of its anticipated 
physics program are being developed in an intense R&D effort carried out 
world-wide. Now, theoretical predictions matching the anticipated exper- 
imental accuracies are crucially needed, as well as further clues on what 
physics scenarios could be unveiled by signals that the LHC may soon be 
observing. These will contribute to further define the physics landscape for 
the ILC. A TeV-scale electron-positron linear collider is an essential com- 
ponent of the research program that will provide in the next decades new 
insights into the structure of space, time, matter and energy. Thanks to the 
efforts of many groups from laboratories and universities around the world, 
the technology for achieving this goal is now in hand, and the prospects for 
the ILC success are extraordinarily bright. 
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Chapter 3 


Astrophysical Aspects of Neutrinos 


John F. Beacom 
Center for Cosmology and Astro-Particle Physics, 
Departments of Physics and Astronomy, 
191 W. Woodruff Ave., Columbus, OH 48210, USA 


Neutrino astronomy is on the verge of discovering new sources, and 
this will lead to important advances in astrophysics, cosmology, par- 
ticle physics, and nuclear physics. This paper is meant for non-experts, 
so that they might understand the basic issues in this field. 


3.1. General Introduction 


It has long been appreciated that neutrino astronomy would have unique 
advantages. The principal one, due to the weak interactions of neutrinos, is 
that they would be able to penetrate even great column densities of matter. 
This could be in dense sources themselves, like stars, supernovae, or active 
galactic nuclei. It could also be across the universe itself. Of course, the 
small interaction cross section is also the curse of neutrino astronomy, and 
to date, only two extraterrestrial sources have been observed: the Sun, and 
Supernova 1987A. That’s it. 

However, a new generation of detectors is coming online, and their capa- 
bilities are significantly better than anything built before. Additionally, a 
great deal of theoretical effort, taking advantage of the very rapid increases 
in the quality and quantity of astrophysical data, has refined estimates of 
predicted fluxes. The basic message is that the detector capabilities appear 
to have nearly met the theoretical predictions, and that the next decade 
should see several exciting first discoveries. 

For these two talks, I was asked to introduce the topics of supernova 
neutrinos and high-energy neutrinos. See the other talks in this volume for 
more about these and related topics. To increase the probability of this 
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paper being read, I have condensed the material covered in my computer 
presentation, focusing on the basic framework instead of the details. In 
the actual lectures, I made extensive use of the blackboard, and of inter- 
action with the students through questions from them (and to them). It 
isn’t possible to represent that here. I thank the students for their active 
participation, and hope that they’ve all solved the suggested problems! 


3.2. PART ONE: Supernova Neutrinos 


3.2.1. Preamble 


Over the centuries, supernovae, which appear as bright stars and then dis- 
appear within a few months, have amazed and confused us. We’re still 
amazed, and as Fermi said, we’re still confused, just on a higher level. The 
historical observations of supernovae were of rare objects in our own Milky 
Way Galaxy (here and elsewhere, “Galaxy” is used for the Milky Way, and 
“galaxy” for the generic case). Now that we know their distances, we know 
that supernovae are extremely luminous in the optical, in fact comparable 
to the starlight from the whole host galaxy. But that’s not the half of it, 
literally. If you had neutrino-detecting eyes, you’d see the neutrino burst 
from a single core-collapse supernova outshine the steady-state neutrino 
emission from all the stars in a galaxy (the analog of solar neutrinos) by a 
factor more like 10° (that’s a lot!). This is what enabled the detection of 
about 20 neutrinos from Supernova (SN) 1987A, despite its great distance. 

A good general rule in decoding physical processes is “Follow the en- 
ergy,” much like “Follow the money” for understanding certain human en- 
deavors. For core-collapse supernovae, this means the neutrinos, while for 
thermonuclear supernovae, this means the gamma rays. These are the 
direct messengers that reveal the details of the explosions. In the follow- 
ing, Pll discuss this in more detail, mostly focusing on the “observational” 
perspective, since it’s easy to be convinced that observing these direct mes- 
sengers is important, while hard to think of how to actually do it. As I will 
emphasize, this is much more than just astronomy for its own sake: these 
data play a crucial role in testing the properties of neutrinos, and more 
generally, in probing light degrees of freedom beyond the Standard Model. 


3.2.2. Introduction 


Stars form from the collapse and fragmentation of gas clouds, and empiri- 
cally, the stellar Initial Mass Function is something like dn/dm ~ m~?:°, 
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where m = Metar/Msun, as first pointed out by Salpeter in 1955, and re- 
fined by many authors since. You'll notice that this distribution is not 
renormalizable, but don’t start worrying about dimensional regularization 
— a simple cutoff near m = 0.1 is enough for our purposes. What is the fate 
of these stars? There are two interesting broad categories. The “types” are 
observational distinctions, based on spectral lines, but the divisions below 
are based on the physical mechanisms. 


e Thermonuclear (Type Ia) supernovae 
These have progenitors with m ~ 3-8, and live for ~ Gyr. The 
interesting case is when the progenitor has ended its nuclear fusion 
processes at the stage of being a carbon/oxygen white dwarf, while 
it has a binary companion that donates mass through accretion. 
Once the mass of the progenitor grows above the Chandrasekhar 
mass of m = 1.4, this carbon and oxygen will explosively burn all 
the way up to elements near iron, generating a tremendous amount 
of energy. The most important isotope produced is °°Ni, which de- 
cays to °°Co with t = 9 days, which then decays to stable °°Fe 
with T = 110 days. These decays produce MeV gamma rays and 
positrons that power the optical light curve. Indeed, a plot of lumi- 
nosity versus time directly shows the two exponential components. 


e Core-collapse (Type II/Ib/Ic) supernovae 

These have progenitors with m ~ 8—40, and live for less than ~ 0.1 
Gyr. Importantly, the dynamics depend only on single stars, and 
not whether they happen to be in binaries or not. As you know, the 
source of stellar energy is nuclear fusion reactions, which burn light 
elements into progressively heavier ones, until elements near iron 
are reached, and the reactions stop being exothermic. Until that 
point, as each nuclear fuel is exhausted, the star contracts until 
the core is hot and dense enough to ignite the next one (remember, 
these reactions are suppressed by the Coulomb barrier). The cutoff 
of m ~ 8 denotes the requirement of being able to burn all the 
way up to iron. So what happens at that point? Once there is a 
m ~ 1.4 iron core, it is no longer generating nuclear energy, but 
it could support itself by electron degeneracy pressure, except for 
the fact that the massive envelope of the star is weighing down on 
it. As discussed below, this leads to the collapse of the core and 
the formation of a hot and dense proto-neutron star, which cools 
primarily by neutrino emission over a timescale of seconds. 
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In both cases, a tremendous amount of energy is released in a time that 
is very short compared to the lifetimes of stars, the resulting optical displays 
are crudely similar, and shell remnants are left behind. For thermonuclear 
supernovae, the source of the energy is nuclear fusion reactions, primarily 
revealed by the gamma rays from nuclear decays. The neutrino emission 
is subdominant, and no compact remnant is left behind. For core-collapse 
supernovae (often referred to as type-II supernovae, even when this is in- 
clusive of types Ib and Ic as well), the source of the energy is gravitational, 
and is primarily revealed by the neutrinos emitted from the newly-formed 
neutron star (which may ultimately become a black hole). There is also 
gamma-ray emission, but it is subdominant compared to the neutrinos. 
Finally, one interesting fact is that for both categories of supernova, the 
explosion energy is about 10°! erg, known as 1 “f.o.e.” (fifty-one erg) or 1 
“Bethe.” Note that this is about 10~?Mc? for 1 solar mass of material. 

The neutrino and gamma-ray emissions from supernovae could in prin- 
ciple be detected from individual objects, or as diffuse glows from all past 
supernovae. Although low-mass stars are much more common than high- 
mass stars, type Ia supernovae are more rare than core-collapse supernovae 
by a factor of several, due to the requirement of being in a suitable binary. 
Before we get into details, here’s where things stand on observations of the 
direct messengers. 


e Gamma rays from thermonuclear supernovae 
These have never been robustly detected from individual objects, 
though in a few cases the COMPTEL instrument set interesting 
limits. While a diffuse background of gamma rays is seen in the 
MeV range (and beyond), it is now thought that supernovae do not 
contribute significantly, making it more of a mystery what does. 


e Neutrinos from core-collapse supernovae 
These have been seen just once, from SN 1987A, but only with 
about 20 events. No diffuse background of neutrinos has been 
seen yet, placing interestingly tight limits on the contribution from 
supernovae. 


For particle physicists, the primary interest is on two points. If neutrinos 
have unexpected properties, or if there are new light particles that effec- 
tively carry away energy, then the neutrino emission per supernova could 
be altered. If there are processes in the universe that produce MeV gamma 
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rays, directly or after redshifting, e.g., dark matter decay, then these may 
explain the observed gamma-ray background. 

Now let’s turn to the basics of the neutrino emission from core-collapse 
supernovae. The gravitational binding energy release can be simply es- 
timated. The gravitational self-energy of a constant-density sphere is 
(3/5)Gy M?2/R, and so 


3GvMig  3GnMig 
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~ 3x 10°% erg ~ 2 x 105° MeV, 

using the observed facts that neutron stars have masses of about m = 1.4 
and radii of about 10 km. Note that the second term in the difference is 
negligible. This is a tremendous amount of energy, trapped inside a very 
dense object, and so no particles can escape and carry away energy except 
neutrinos. In fact, even the neutrinos must diffuse out, as the density is 
high enough to counteract the smallness of their interaction cross sections. 

The core collapses until it reaches near-nuclear densities, at which point 
it cannot proceed further, and hitting this wall creates an outgoing shock. If 
successful, the shock will propagate though the envelope of the star, lifting 
it off and creating the optical supernova. If not, it will stall, and then the 
inflow of further material will lead to black hole formation and no optical 
supernova. 

The neutrinos are emitted from the core, within seconds of the collapse, 
and carry nearly the full binding energy release noted above. It takes 
perhaps hours or days for the shock to break through the envelope and 
begin the optical supernova, which is then bright for months. Importantly, 
the neutrinos are received before the light. It’s not that they are tachyons, 
but rather just that they were emitted first. The kinetic energy of the 
supernova ejecta is only ~ 1% of the total energy, and the energy in the 
optical emission is even less. The neutrinos are the most interesting, since 
they carry most of the energy, are emitted in the shortest and earliest time, 
and come from the densest regions. Other than gravitational waves, which 
have yet to be observed, only neutrinos can reveal the inner dynamics of 
the core collapse process. 

As noted, the neutrinos diffuse through the proto-neutron star, mean- 
ing that they leave on a longer timescale and with lower energies than they 
would if it were less dense. It is typically assumed that the neutrino emis- 
sion per flavor (all six, counting neutrinos and antineutrinos) is comparable. 
That is, each takes about 1/6 of the binding energy, and has thermal spec- 
tral with average energies of 10-20 MeV. There is a vast literature about 
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Fig. 3.1. Scatterplots of the neutrino events associated with SN1987A, as seen in the 
Kamiokande [2], IMB [3], and Baksan [4] detectors. The shaded regions indicate the 
nominal detector energy thresholds. Figure taken from Ref. [5]. 


the differences between flavors, and using this to test neutrino mixing, but 
this is beyond our scope. 

The SN 1987A data are shown in Fig. 3.1. These are consistent with 
mostly being signal events due to inverse beta decay on free protons, De + 
p— et +n. This reaction channel is special due to its large cross section, 
and the fact that the outgoing positron carries nearly the full antineutrino 
energy. The other flavors are much harder to detect. The first thing to 
notice is that the duration of the burst was about 10 seconds. The second 
is that the typical energies were low tens of MeV. (This is complicated 
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somewhat by the nontrivial response function of the detectors, especially 
IMB, which was only effective at the highest energies.) At zeroth order, the 
Kamiokande and IMB data are consistent with each other and theoretical 
expectations. The Baksan data are quite puzzling, as this detector was 
about ten times smaller than Kamiokande, and thus they should have seen 
~ 1 event; probably detector backgrounds were present. 

The most important message is that these data are consistent with the 
picture of slow diffusion out of a very hot and dense object, i.e., with the 
birth of a neutron star, as is suggested also by the total energetics, assuming 
a comparable neutrino emission per flavor. You can easily estimate the 
number of detected events yourself, using the total energy noted above, 
the inverse beta cross section, and the distance of 50 kpc. (Interestingly, 
there is still no good astronomical evidence for such a compact object in 
the SN 1987A remnant.) This kind of basic confirmation of the explosion 
mechanism is what can do with such a small number of neutrino events. 

How can we gather more supernova neutrinos? There are three possi- 
bilities. First, Milky Way objects, with D ~ 10 kpc. Taking into account 
the fact that we have much larger detectors now, and assuming a typical 
distance in the Milky Way, we expect about 104 detected events in Super- 
Kamiokande. Unfortunately, the frequency is probably only 2 or 3 times per 
century, but we might get lucky. It will be very obvious if it happens. Sec- 
ond, Nearby objects with D ~ 10 Mpc or less. For these, one would need 
a much larger detector, at the 1 Mton scale, and the number of detected 
events per supenova is ~ 1. On the other hand, the frequency is about 
once per year. To reduce backgrounds, this would require a coincident de- 
tection of say two or more neutrinos, or one neutrino and the optical signal. 
Third, Distant objects from redshifts z ~ 1-2 or less. As a crude guide to 
how this works, imagine a supernova at a distance such that the expected 
number of detected events in Super-Kamiokande is 1078. Almost all of the 
time, nothing happens, but for one supernova in a million, one neutrino 
will be detected. This seems crazy until you realize that the supernova rate 
of the universe is a few per second. Putting this together more carefully 
leads to an expectation of several detected supernova events per year in 
Super-Kamiokande (these will be uncorrelated with the optical supernovae, 
due to the nearly isotropic nature of the detection cross section). A strong 
rejection of detector backgrounds is required to make this work. 

Of these three detection modes, I’ll focus on the last, as it is the least 
familiar. 
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3.2.3. Supernovae in the Milky Way 


At present, the flagship supernova neutrino detector is Super-Kamiokande, 
which is located in a deep mine in Japan. It is the largest detector with 
the ability to separate individual supernova neutrino events from detector 
backgrounds. Its huge fiducial volume contains 22.5 kton of ultrapure water. 
Relativistic charged particles in a material emit optical Cerenkov radiation, 
which is detected by photomultiplier tubes around the periphery. 

With ~ 104 events detected for a Milky Way supernova, the Super- 
Kamiokande data could be used to map out the details of the neutrino 
spectrum and luminosity profile. Additionally, other neutrino detection 
reactions, for which the yields are at the 1-10% level in comparison to 
inverse beta decay, would become important, revealing more about the 
flavors besides Pe. The aspects of detecting a Milky Way supernova are 
very interesting, and have been extensively discussed elsewhere. 


3.2.4. Supernovae in Nearby Galaxies 


If Super-Kamiokande can detect 104 events at a supernova distance of 10 
kpc, then it can expect to detect 1 event for a supernova distance of 1 
Mpc, somewhat larger than the distance to the M31 (Andromeda) and 
M33 (Triangulum) galaxies. Unfortunately, a single event isn’t exciting by 
itself, and anyway, these galaxies appear to have even lower supernova rates 
than the Milky Way. Still, it makes one wonder about greater distances. 
The number of galaxies in each new radial shell in distance increases like 
D?, while the flux of each falls like 1/D?. As mentioned, one can beat even 
small Poisson expectations with enough tries, so this is intriguing. 

An estimate based on the known nearby galaxies shows that the super- 
nova rate with 10 Mpc should be about one per year, and this is shown in 
Fig. 3.2. In fact, the observed rates in the past few years have been even 
higher. A detailed calculation shows that a larger detector than Super- 
Kamiokande, something more on the 1 Mton scale, could detect about one 
supernova neutrino per year. (Such detectors are being considered for pro- 
ton decay studies and as targets for long-baseline neutrino beams.) That 
seems like a small rate, but bear in mind that in the twenty years since SN 
1987A, exactly zero supernova neutrinos have been (identifiably) detected. 
To reduce backgrounds, these nearby supernovae would need a coincidence 
of at least two neutrinos or one neutrino and the optical signal. Perhaps 
most importantly, the detection of even a single neutrino would fix the 
start time of the collapse to about ten seconds, compared to the precision 
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Fig. 3.2. The predicted cumulative supernova rate for nearby galaxies is shown by the 
blue line, and its uncertainty by the grey band (together denoted as “Galaxy Catalog”). 
The redshift z = 0 limit of the cosmic supernova rate is also shown (“Continuum Limit”). 
The observed local supernova rate in recent years has been higher than either prediction. 
Figure taken from Ref. [6]. 


of about one day that might be determined from the optical signal. This 
would be very useful for refining the window in which to look for a faint 
gravitational wave signal. 

Related to this is an effort called NO SWEAT (Neutrino-Oriented Su- 
pernova Whole-Earth Telescope), led by Avishay Gal-Yam, to use a network 
of telescopes worldwide to find all supernovae in nearby galaxies. 


3.2.5. DSNB: First Good Limit 


The star formation rate was larger in the past, and in particular, was about 
10 times larger at redshift z ~ 1 than it is today. Since the lifetimes 
of massive stars are short, the core-collapse supernova rate should closely 
follow the evolution of the star formation rate, up to a constant factor. This 
gives more weight to distant supernovae than if the rate were constant. On 
the other hand, for supernova beyond z ~ 1, the neutrinos are so redshifted 
that their detection probabilities are too low (at lower energy, the detection 
cross section goes down while the detector background rates increase). 
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Fig. 3.3. The event spectrum measured in Super-Kamiokande is denoted by the points 
with error bars. The solid line indicates the expected total detector background rate 
(the dotted component is due to muon neutrinos, and the dot-dashed component is due 
to electron neutrinos). The dashed line above the solid line indicates how large of an 
excess due to DSNB events could be present, given the statistical uncertainties. Figure 
taken from Ref. [7]. 


Integrating the neutrino emission per supernova with the evolving su- 
pernova rate, and taking into account the cosmological factors, the accu- 
mulated spectrum of all past supernovae can be calculated. This is known 
as the Diffuse Supernova Neutrino Background (DSNB), or sometimes as 
relic supernova neutrinos (which is a confusing and deprecated term, i.e., 
these have nothing to do with the 2 K relic background of neutrinos that 
decoupled just before big-bang nucleosynthesis). 

In 2000, a paper by Kaplinghat, Steigman, and Walker calculated the 
largest plausible DSNB flux, and found it to be 2.2 cm7? s~! for electron 
antineutrinos above 19.3 MeV. This was about 100 times smaller than the 
existing limit from Kamiokande, so the prospects for detection didn’t look 
great. Other calculations with reasonable inputs (by modern standards) 
gave results that were a few to several times smaller. 

In 2003, the Super-Kamiokande collaboration published a limit that was 
1.2 in the above units. This was a milestone, because it showed for the first 
time that there was hope of reaching the range in which a detection might be 
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made. Still, as shown in Fig. 3.3, there are large detector backgrounds that 
make it difficult to identify the DSNB signal. Note that for a background- 
limited search, like this one, to improve the signal sensitivity by a factor of 
3 takes a factor 9 more statistics. Since this figure was based on 4 years of 
data, this would take a long time to collect (comparable to the wait for a 
Galactic supernova!). 


3.2.6. DSNB: Detection with Gadolinium 


In order to make progress, it is necessary to find a way to eliminate or at 
least severely reduce the detector background. Mark Vagins (a member 
of the Super-Kamiokande collaboration) and I decided to put our heads 
together to find a way to isolate the DSNB signal. This resulted in a 2004 
article in Physical Review Letters, though we were forced to remove the 
code name of the project, “GADZOOKS!,” from the title and text (but see 
the arXiv version). Recall that the detection reaction is Pe + p > et +n, 
and that at present, only the positron is detected. We realized that the key 
was to detect the neutron in time and space coincidence with the positron. 
This is an old idea, and was used by Reines and Cowan in the first detection 
of neutrinos (antineutrinos from a nuclear reactor). 

Saying that we had to detect the neutron was the easy part. It was 
more challenging to find a way to do this in a water-based detector, where 
normally the neutrons capture on free protons. That produces a 2.2 MeV 
gamma ray that Compton scatters electrons, but they are too low in energy 
to be detectable. We pointed out that the required neutron tagging might 
be possible by using a 0.2% admixture of dissolved gadolinium trichloride 
(GdCls). Gadolinium has a huge neutron capture cross section, and pro- 
duces an 8 MeV gamma-ray cascade that reconstructs as an equivalent 
single electron of about 5 MeV, which is readily detectable. 

The really hard part was in establishing that this technique might be 
possible in practice, which involved raising and answering many difficult 
technical questions. (Among them, finding a suitable water-soluble com- 
pound of gadolinium.) Somewhat to our surprise, we found no obvious 
obstacles. Mark Vagins has been leading a detailed research and develop- 
ment effort, and so far, the prospects look very good. 

In Fig. 3.4, the spectra expected in Super-Kamiokande if gadolinium is 
added are shown. The atmospheric neutrino backgrounds mentioned above 
are reduced by a factor of about 5. Additionally, backgrounds at lower 
energies are severely reduced, allowing the use of a much lower threshold 
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Fig. 3.4. The DSNB signal and detector backgrounds expected in Super-Kamiokande 
if gadolinium is added. Figure taken from Ref. [8]. 


energy. At moderate energies, it should be possible to cleanly identify 
DSNB signal events. 


3.2.7. DSNB: Astrophysical Impact 


Now let’s return to the predicted DSNB spectrum. If either the assumed 
star formation rate or the neutrino emission per supernova were too large, 
then the predicted DSNB flux would already be ruled out the the Super- 
Kamiokande data. 

Even since the time of the Super-Kamiokande limit, the astrophysical 
data have improved substantially. Andrew Hopkins and I synthesized a 
wide variety of data to constrain the star formation and supernova rate 
histories. An example fit is shown in Fig. 3.5. The uncertainty band is much 
more narrow now than it was just a few years ago. The normalization of 
the cosmic star formation rate depends on dust corrections. If the true star 
formation rate were even somewhat larger than determined here, then the 
DSNB neutrino flux would be too large relative to the Super-Kamiokande 
limit. The only way out would be to require a substantially lower neutrino 
emission per supernova. 
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Fig. 3.5. The star formation rate history, with selected data shown by points and the 
fit and uncertainty shown by the bands. Figure taken from Ref. [9]. 


The corresponding calculated supernova rates are in good agreement 
with data. As an interesting aside, it was shown that the diffuse gamma- 
ray background from type Ia supernovae is too small to account for the 
observed data in the MeV range. That is particularly significant because 
many limits on exotic particle physics depend on just this energy range. 


3.2.8. Back to the Scene of the Crime: SN 1987A 


If we now know the star formation history, then the only remaining un- 
known is the neutrino emission per supernova. Hasan Yiiksel, Shin’ichiro 
Ando, and I considered how well the Super-Kamiokande data already re- 
strict the neutrino emission per supernova. The emission models are usually 
parametrized in terms of the time-integrated luminosity (or portion of the 
binding energy release) and the average energy per neutrino (related to the 
temperature of the spectrum). I mentioned above that the Kamiokande 
and IMB data on the emission from SN 1987A were mostly consistent. In 
fact, when fitted with thermal spectra, there are some discrepancies. 
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Fig. 3.6. The contours labeled Kam-II and IMB are the allowed regions from the SN 
1987A data, assuming a thermal spectrum [10]. The shaded region is what is already 
excluded by the non-observation of a DSNB signal in Super-Kamiokande. Figure taken 
from Ref. [11]. 


In Fig. 3.6, I show that the DSNB data are probing neutrino emission 
parameters only slightly larger than those deduced from the SN 1987A data. 
With reduced detector backgrounds, the DSNB spectrum would be a new 
way to measure the neutrino emission per supernova. 


3.2.9. Conclusions 


Why is understanding supernovae interesting and important? For particle 
physics, it is to test the properties of neutrinos, and to search for new 
low-mass particles that cool the proto-neutron star. For nuclear physics, 
it is to constrain the neutron star equation of state and to shed light on 
the formation of the elements. For astrophysics, it is to understand the 
stellar life and death cycles and to understand the supernova mechanisms. 
For cosmology, it is to better understand the details of whether type Ia 
supernovae are standard candles, and to probe the origins of the gamma- 
ray and neutrino backgrounds. With more data, we can’t lose. 
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3.3. PART TWO: High-Energy Neutrinos 


3.3.1. Introduction 


Now that we’ve covered the specific example of supernova neutrinos, let’s 
step back and comment on the general status and outlook in neutrino 
astrophysics. 

Unique among the Standard Model fermions, neutrinos are neutral, and 
more generally, have only weak interactions. This makes them potentially 
sensitive to even very feeble postulated new interactions. While the discov- 
ery of neutrino mass and mixing was “new physics” beyond the minimal 
Standard Model, the discovery of any new interactions would be a much 
more radical step, as it would require new particles as well. 

This is one reason that we’re interested in neutrinos. The other, already 
discussed, is that they will be especially powerful probes of astrophysical 
objects, once these neutrinos are detected. Already with the neutrinos 
from the Sun and SN 1987A, the scientific return was very rich: not only 
confirmation of the physics of their interiors, but also a crucial piece in the 
discovery of neutrino mass and mixing. Ray Davis and Masatoshi Koshiba 
shared in the 2002 Nobel Prize for this work, and their citation reads, “...for 
pioneering contributions to astrophysics, in particular for the detection of 
cosmic neutrinos....” 

The general achievements in neutrino physics in just the recent past 
might be summarized as follows. The cosmological results are the con- 
sistency of big-bang nucleosynthesis yields with three flavors of neutrinos, 
and the exclusion of neutrinos as the (hot) dark matter. In both cases, 
these facts have been established independently in the laboratory as well. 
The astrophysical results are the discovery of neutrinos from SN 1987A 
and the solution of the solar neutrino problem. The fundamental results 
are the discovery of neutrino mass and mixing, and the clear exclusion of a 
huge range of formerly allowed models of exotic neutrino properties. 

One of the lessons from this list is that we need data from new sources 
to make new discoveries, and that those discoveries may have a broader 
impact than initially thought. Astrophysical sources reach extremes of 
density, distance, and energy, and this will allow unprecedented tests of 
neutrino properties, for example. 

We can identify three frontiers where new sources will likely be discov- 
ered soon. By the rough energy scale of the neutrinos, we might call these 
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the MeV (107° TeV) scale, the TeV scale, and the EeV (10° TeV) scale. 
At the MeV scale, the focus is on the Visible Universe, i.e., stars and su- 
pernovae, and Super-Kamiokande is the main detector. At the TeV scale, 
the focus is on the Nonthermal Universe, i.e., jets powered by black 
holes, and the primary detector is AMANDA, which is being succeeded by 
IceCube. At the EeV scale, the focus is on the Extreme Universe, i.e., 
at the energy frontier of the highest-energy cosmic rays, and one of the key 
detectors is ANITA. 

Why do we think that high-energy neutrinos even exist? First, because 
cosmic rays (probably mostly protons) are observed at energies as high as 
10% eV, and they are increasingly abundant down to at least the GeV 
range. Something is accelerating these cosmic rays, and it is very likely 
that these sources also produce neutrinos. Second, because extragalactic 
gamma-ray sources have been observed with energies up to about 10 TeV 
(and galactic sources up to about 100 TeV). Again, something is producing 
these particles, and in large fluxes, and it is likely that neutrinos are also 
produced. 

So then why do we need neutrinos? The problem with cosmic rays is that 
they are easily deflected by magnetic fields, and so only their isotropic flux 
has been observed, making the identification of their sources very difficult. 
The problem with photons is that they are easily attenuated: a TeV gamma 
ray colliding with an eV starlight photon is able to produce an electron- 
positron pair. Thus at high energies, only nearby objects can be seen. 

High-energy neutrinos can be made through either proton-proton or 
proton-photon collisions, depending on energies. In either case, pions are 
readily produced, and typically comparable numbers of neutral and charged 
pions are made. Neutral pions decay as 7° — y+y, and charged pions decay 
as mt — pt + Vp, followed by pt > et + ve + D, (with obvious changes 
for the charge conjugate). This is the hadronic mechanism for producing 
gamma rays and neutrinos. There is also a leptonic mechanism, based 
on the inverse Compton scattering reaction e~ + y — y+ e7, where fast 
electrons collide with low-energy photons and promote them to high-energy 
gamma rays. Note that the leptonic process produces no neutrinos. It is 
a major mystery whether the observed high energy gamma-ray sources are 
powered by the hadronic or leptonic mechanism. This is a key to uncovering 
the sources of the cosmic rays. 
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Fig. 3.7. The dotted line is based on gamma-ray observations of the nearby AGN 
Markarian 501 by the HEGRA experiment, and the shaded band is a calculation that 
removes the assumed affects of attenuation en route. The labeled solid lines indicate 
AMANDA limits on the neutrino flux. This object flares, and the gamma ray and 
neutrino data are not contemporaneous. Figure taken from Ref. [12]. 


3.3.2. Sources and Detection at ~ 1 TeV 


At the simplest level, hadronic sources produce nearly equal fluxes of 
gamma rays and neutrinos (the corrections due to multiplicities, decay en- 
ergies, and neutrino mixing can be easily taken into account). Therefore, 
the observed gamma-ray spectrum of an object like an AGN is a strong pre- 
dictor of the neutrino spectrum, if the source is hadronic (if it is leptonic, 
then the neutrino flux will be zero). Any attenuation of the gamma-ray 
spectrum en route would mean that the neutrino flux would be even larger. 
An example is illustrated in Fig. 3.7, where it is shown that the neutrino 
detectors are now approaching the required level of flux sensitivity. 

For hadronic sources, the initial neutrino flavor ratios (adding neutrinos 
and antineutrinos) are ĝe : ¢, : Ọr = 1 : 2:0, following simply from the 
pion and muon decay chains. After vacuum neutrino mixing en route, these 
will become Qe : ¢,:¢, =1:1:1. 

Of all flavors, the muon neutrinos are the easiest to detect and identify. 
Through charged-current deep inelastic scattering reactions, these produce 
muons that carry most of the neutrino energy, and which have only a very 
small deflection from the neutrino direction. Muons and other charged 
particles produce optical Cerenkov radiation in the detector, which is reg- 
istered by photomultiplier tubes throughout the volume. Muons produce 
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Fig. 3.8. The sensitivity of IceCube is marked with heavy solid lines, as labeled. The 
broken lines indicate various astrophysical diffuse flux models. The shaded regions indi- 
cate the atmospheric neutrino, prompt/charm component thereof, and Galactic neutrino 
backgrounds. Figure taken from Ref. [13]. 


spectacular long tracks that can range through the kilometer of the detec- 
tor and beyond. The detection of electron and tau neutrinos is interesting 
and important too, but beyond our scope here. 

To screen out enormous backgrounds from downgoing atmospheric 
muons, these detectors only look for upgoing events. Since muons can- 
not pass through Earth, these muons must have been created just below 
the detector by upgoing neutrinos. Even after this, there are backgrounds 
due to atmospheric neutrinos, themselves produced on the other side of 
Earth, and thus hardly extraterrestrial. 

An astrophysical point source can be identified as an excess in a given 
direction, whereas the atmospheric neutrino background is smoothly vary- 
ing. Transient point sources are even easier to recognize. On the other 
hand, diffuse astrophysical neutrino fluxes are quite hard to separate from 
the atmospheric neutrino background. The principal technique is that the 
former are believed to have spectra close to E~*, while the latter is closer 
to E~%, and steeper at higher energies. Thus at high energies the astro- 
physical diffuse fluxes should emerge as dominant. Once cannot go too 
high in energy — the event rates get too low, and Earth becomes opaque to 
neutrinos at around 100 TeV. An example of the diffuse flux sensitivity of 
IceCube is shown in Fig. 3.8. 
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3.3.3. Testing Neutrino Properties 


As an example of a novel neutrino property that could be tested once as- 
trophysical sources are observed, consider neutrino decay. Why should 
neutrinos decay? Other than the fact that there is no interaction that can 
cause fast neutrino decay, why shouldn’t they decay? The other massive 
fermions all decay into the lowest-mass generation in their family. (Neu- 
trinos can too, via the weak interaction, but it is exceedingly slow.) We’ll 
consider simply neutrino disappearance, i.e., that the other particle in the 
decay of one neutrino mass eigenstate to another is too weakly interacting 
to be detected. It is quite hard to test for the effects of such decays. 
Decay will deplete the original flux as 


m 


exp (—t/Tiab) = exp (-3 x z) ; 
E rT 
where L is distance, E the energy, m the mass, and 7 the proper lifetime. 
For the Sun, the T/m scale that can be probed is up to about 1074 s/eV. 
On the other hand, for distant astrophysical sources of TeV neutrinos, L/E 
may be such that T/m up to about 10** s/eV is relevant! 

How can we tell if decay has occurred, if the neutrino fluxes are uncer- 
tain? As mentioned, the flavor ratios after vacuum oscillations are expected 
to be ġe : Gy : Qr = 1:1: 1. However, it is among the mass eigenstates, not 
the flavor eigenstates, where decays take place. Suppose that the heaviest 
two mass eigenstates have decayed, leaving only the lightest mass eigen- 
state. What is its flavor composition? In the normal hierarchy, it has flavor 
ratios Qe : by : r ~ 5: 1: 1, whereas in the inverted hierarchy, they 
are ~ 0: 1: 1. In either case, they are quite distinct from the no-decay 
case, and the flavor identification capabilities of IceCube should be able to 
distinguish these possibilities. 


3.3.4. Sources and Detection at ~ 10® TeV 


Cosmic rays have been observed at energies above 107° eV, and there are 
no good answers as to what astrophysical accelerators may have produced 
them. However, this becomes even more puzzling when it is noted that the 
universe should be opaque to protons above about 3x 101° eV traveling over 
more than 100 Mpc. There are no obvious sources within that distance. 
The process by which protons are attenuated is p+ y > p+r, n+ rt, 
where both final states are possible, and the target photon is from the 


cosmic microwave background. As with the hadronic processes discussed 


120 John F. Beacom 


[a T T T T T T T T T T T T T T T 
107 E Limits: | ~— 7-7 AMANDA-coscades 
E ©&—Ə RICE-2003 


108 Ly ANITA-lite (this work) 
jo Frew P— GLUE-2003 

#—+ FORTE-2004 
104 E OO ANITA- (projected) 


0.01 S TD:M,= 1e15GeV 
aana TD:Mirror 


logy(dF/d(logE), km=? yr-' ster-') 


—-—- Zburst! 


+ Zburst2 


8 10 12 14 
log,,(energy E, GeV) 


Petre o Me TT TTT d siod mer MT MTT MTT o iunt 


S 
pm r 


Fig. 3.9. Real and projected neutrino flux sensitivities of various experiments (lines 
with points), along with various models (as labeled). Figure taken from Ref. [14]. 


above, the neutral pion decays produce gamma rays and the charged pion 
decays produce neutrinos. The gamma rays are themselves attenuated, but 
the neutrino flux builds up when integrating over sources everywhere in the 
universe. Since the attenuation process for the protons is called the GZK 
process (Greisen-Zatsepin-Kuzmin), these are called GZK neutrinos. Typi- 
cal energies are in the EeV range, and an isotropic diffuse flux is expected. 

New experiments are being deployed to search for the GZK neutrino 
flux, as shown in Fig. 3.9. Unlike IceCube, which is based on optical 
Čerenkov radiation, ANITA and other experiments are based on radio 
Čerenkov radiation that is emitted coherently from the whole shower ini- 
tiated by a neutrino in the ice or other transparent medium. ANITA is 
using the Antarctic ice cap as the detector, and is observing it with radio 
antennas mounted on a balloon. So far, detector backgrounds appear to be 
negligible, meaning that it should be straightforward to improve the signal 
sensitivity with more exposure. 

In Fig. 3.10, I show the results of a very recent calculation of the ex- 
pected GZK neutrino fluxes. 
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Fig. 3.10. Predicted GZK neutrino fluxes, assuming that ultrahigh energy cosmic rays 
are produced in gamma-ray bursts, and according to how the latter rate evolves with 
redshift (i.e., following the star formation rate alone, or rising like that of the quasars, or 
depending on both the star formation rate and the evolving local metallicity. The “WB” 
band is the Waxman-Bahcall bound. The curves with points are projected sensitivities 
for ANITA (upper) and ARIANNA (lower). Figure taken from Ref. [15]. 


Interestingly, when adjusted for the neutrino-quark center of mass en- 
ergy, the detection reactions are probing above the TeV scale, opening the 
prospect of sensitivity to new physics in the detection alone. 


3.3.5. Conclusions 


So far, zero high-energy astrophysical neutrinos have been detected. How- 
ever, the near-term prospects are very good, and are strongly motivated by 
measured data on high-energy protons and photons. Still, this will not be 
easy, and large detectors with strong background rejection will be needed. 
If successful, these experiments will make important astrophysical discover- 
ies, e.g., whether gamma-ray sources are based on the hadronic or leptonic 
mechanisms, the origins of cosmic rays at all energies, etc. We might even 
learn something new about neutrinos in the process! 
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The origin of the asymmetry between matter and anti-matter of the 
Universe has been one of the great challenges in particle physics and 
cosmology. Leptogenesis as a mechanism for generating the cosmolog- 
ical baryon asymmetry of the Universe has gained significant interests 
ever since the advent of the evidence of non-zero neutrino masses. In 
these lectures presented at TASI 2006, I review various realizations of 
leptogenesis and allude to recent developments in this subject. 


4.1. Introduction 


The understanding of the origin of the cosmological baryon asymmetry 
has been a challenge for both particle physics and cosmology. In an ex- 
panding Universe, which leads to departure from thermal equilibrium, a 
baryon asymmetry can be generated dynamically by charge-conjugation 
(C), charge-parity (CP) and baryon (B) number violating interactions 
among quarks and leptons. Possible realizations of these conditions have 
been studied for decades, starting with detailed investigation in the context 
of grand unified theories. The recent advent of the evidence of non-zero neu- 
trino masses has led to a significant amount of work in leptogenesis. This 
subject is of special interests because the baryon asymmetry in this sce- 
nario is in principle entirely determined by the properties of the neutrinos. 


* Address after January 1, 2007. 
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Fig. 4.1. The power spectrum anisotropies defined in Eqs. 4.2 and 4.3 as a function of 
the multiple moment, l. Figure taken from Ref. [2]. 


In these lectures, I discuss some basic ingredients of leptogenesis as well as 
recent developments in this subject. 

These lectures are organized as follows: In Sec. 4.1, I review the basic in- 
gredients needed for the generation of baryon asymmetry and describe var- 
ious mechanisms for baryogenesis and the problems in these mechanisms. 
In Sec. 4.2, I introduce the standard leptogenesis and Dirac leptogenesis as 
well as the problem of gravitino over-production that exists in these stan- 
dard scenarios when supersymmetry is incorporated. This is followed by 
Sec. 4.3, in which several alternative mechanisms that have been invented to 
alleviate the gravitino over-production problem are discussed. Section 4.4 
focuses on the subject of connecting leptogenesis with low energy leptonic 
CP violating processes. Section 4.5 concludes these lectures with discus- 
sions on the recent developments. For exiting reviews on the subject of 
leptogenesis and on baryogenesis in general, see e.g. Refs. [1-4] and [5-7]. 


4.1.1. Evidence of Baryon Number Asymmetry 


One of the main successes of the standard early Universe cosmology is the 
predictions for the abundances of the light elements, D, 7He, He and “Li. 
(For a review, see, Ref. [8]. See also Scott Dodelson’s lectures.) Agreement 
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between theory and observation is obtained for a certain range of parameter, 
npg, which is the ratio of the baryon number density, ng, to photon density, 
Ny, 


pen = 28 — (2.6 — 6.2) x 1072. (4.1) 
Voy 
The Cosmic Microwave Background (CMB) is not a perfectly isotropic ra- 
diation bath. These small temperature anisotropies are usually analyzed by 
decomposing the signal into spherical harmonics, in terms of the spherical 
polar angles 0 and ¢ on the sky, as 


AT =D ain ¥in(9,9) (4.2) 
l,m 


where am are the expansion coefficients. The CMB power spectrum is 
defined by 


Ch = (\aim|?) ; (4.3) 


and it is conventional to plot the quantity I(l + 1)C; against |. The CMB 
measurements indicate that the temperature of the Universe at present is 
Tnow ~ 3°K. Due to the Bose-Einstein statistics, the number density of the 
photon, n4, scales as T’. Together, these give a photon number density at 
present to be roughly 400/cm?. It is more difficult to count the baryon num- 
ber density, because only some fraction of the baryons form stars and other 
luminous objects. There are two indirect probes that point to the same 
baryon density. The measurement of CMB anisotropies probe the acoustic 
oscillations of the baryon/photon fluid, which happened around photon last 
scattering. Figure 4.1 illustrates how the amount of anisotropies depends 
on ng/ny. The baryon number density, ng ~ 1/m?, is obtained from the 
anisotropic in CMB, which indicates the baryon density Qg to be 0.044. 
Another indirect probe is the Big Bang Nucleosynthesis (BBN), whose pre- 
dictions depend on ng/ny through the processes shown in Fig. 4.2. It is 
measured independently from the primordial nucleosynthesis of the light 
elements. The value for ng/ny deduced from primordial Deuterium abun- 
dance agrees with that obtained by WMAP [9]. For He and ‘Li, there are 
nevertheless discrepancies which may be due to the under-estimated errors. 
Combining WMAP measurement and the Deuterium abundance gives, 


— = ņg = (6.1 + 0.3) x 107°. (4.4) 
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4.1.2. Sakharov’s Conditions 


A matter-anti-matter asymmetry can be dynamically generated in an ex- 
panding Universe if the particle interactions and the cosmological evolution 
satisfy the three Sakharov’s conditions [10]: (2) baryon number violation; 
(ii) C and CP violation; (iii) departure from thermal equilibrium. 


4.1.2.1. Baryon Number Violation 


As we start from a baryon symmetric Universe (B = 0), to evolve to a 
Universe where B 4 0, baryon number violation is necessary. Baryon num- 
ber violation occurs naturally in Grand Unified Theories (GUT), because 
quarks and leptons are unified in the same irreducible representations. It 
is thus possible to have gauge bosons and scalars mediating interactions 
among fermions having different baryon numbers. In the SM, on the other 
hand, the baryon number and the lepton number are accidental symme- 
tries. It is thus not possible to violate these symmetries at the tree level. 
t’Hooft realized that [11] the non-perturbative instanton effects may give 
rise to processes that violate (B + L), but conserve (B — L). Classically, B 
and L are conserved, 


B= f tat L= J Perto) l (4.5) 


3He n > 4He y 
3He D + 4He p 


Dp — 3Hey 
DD —3Hen 


Fig. 4.2. Main reactions that determine the primordial abundances of the light elements. 
Figure taken from Ref. [2]. 
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Table 4.1. Standard model fermions and their B and L 


charges. 
v 
qL = ( ) us dt LL = ( es 
a}, es, * 
B 1/3 -1/3 -1/3 0 0 
L 0 0 0 1 -1 


where the currents associated with B and L are given by, 


1 = =C c qc C 
JF ~3 5 (ruar: — UL; MeL, T didi, ) ’ (4.6) 


uv 


JE = SS (Gaul: — Fh, Yueh) - (4.7) 


a 


Here qz refers to the SU(2), doublet quarks, while uz and dz refer to the 
SU(2), singlet quarks. Similarly, Zz refers to the SU(2)z lepton doublets 
and ez refers to the SU (2)z charged lepton singlets. The B and L numbers 
of these fermions are summarized in Table 4.1. The subscript i is the gen- 
eration index. Even though B and L are individually conserved at the tree 
level, the Adler-Bell-Jackiw (ABJ) triangular anomalies [12] nevertheless 
do not vanish, and thus B and L are anomalous [13] at the quantum level 
through the interactions with the electroweak gauge fields in the triangle 
diagrams (see, for example Ref. [14] for details). In other words, the di- 
vergences of the currents associated with B and L do not vanish at the 
quantum level, and they are given by 


N Iy puv Duv 
OTB = OWS, = oe (32w, We — 9? Buy BY ) , (4.8) 


where W,,, and B,, are the SU(2)z and U(1)y field strengths, 
Why = O.W?P — OW? (4.9) 
Bpv = OpBy — WB, , (4.10) 
respectively, with corresponding gauge coupling constants being g and g’, 
and Ny is the number of fermion generations. As oe( JB — JE) = 0, (B-L) 


is conserved. However, (B+ L) is violated with the divergence of the current 
given by, 


OM (I? + JF) =2Nr0,K" , (4.11) 
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where 
g? 
3272 
g”? 
toga By vBig: 


This violation is due to the vacum structure of non-abelian gauge theories. 
Change in B and L numbers are related to change in topological charges, 


Bits) - =f" at f ae ony? (4.13) 
= = Ny[Nes(tp) = Nes(ti)] ; 


K" = —— 997 WP (8, W? + or a) (4.12) 


where the topological charge of the gauge field (i.e. the Chern-Simons 
number) Nes is given by, 


3 . . 
Nalt) = 2 | Prend www, (4.14) 
967? 
There are therefore infinitely many degenerate ground states with 
ANes = EL 2, vas , separated by a potential barrier, as depicted by 


Fig. 4.3. In semi-classical approximation, the probability of tunneling be- 


E[A] 


Esp 


-2 -1 1 2 3 
Nog IA] 


Fig. 4.3. The energy dependence of the gauge configurations A as a function of the 
Chern-Simons number, Nes[A]. Sphalerons correspond to the saddle points, i.e. maxima 
of the potential. 


tween neighboring vacua is determined by the instanton configurations. In 
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SM, as there are three generations of fermions, AB = AL = NsAN¢s = 
+3n, with n being an positive integer. In other words, the vacuum to vac- 
uum transition changes AB and AL by multiples of 3 units. As a result, 
the SU(2) instantons lead to the following effective operator at the lowest 
order, 


OB+L = II (QL:QL:4L:bL:) + (4.15) 
1=1,2,3 


which gives 12 fermion interactions, such as, 
T+d+T— d+2s+2b+ tt ve + Vp +v . (4.16) 


At zero temperature, the transition rate is given by, T ~ e7Simt = 
e47/~ — O(107185) [11]. The resulting transition rate is exponentially 
suppressed and thus it is negligible. In thermal bath, however, things can 
be quite different. It was pointed out by Kuzmin, Rubakov and Shaposh- 
nikov [15] that, in thermal bath, the transitions between different gauge 
vacua can be made not by tunneling but through thermal fluctuations over 
the barrier. When temperatures are larger than the height of the barrier, 
the suppression due to the Boltzmann factor disappear completely, and thus 
the (B+ L) violating processes can occur at a significant rate and they can 
be in equilibrium in the expanding Universe. The transition rate at finite 
temperature in the electroweak theory is determined by the sphaleron con- 
figurations [16], which are static configurations that correspond to unstable 
solutions to the equations of motion. In other words, the sphaleron config- 
urations are saddle points of the field energy of the gauge-Higgs system, as 
depicted in Fig. 4.3. They possess Chern-Simons number equal to 1/2 and 
have energy 

Esp(T) > = (H(T)) , (4.17) 
which is proportional to the Higgs vacuum expectation value (vev), (H(T)), 
at finite temperature T. Below the electroweak phase transition temper- 
ature, T < Tew, (i.e. in the Higgs phase), the transition rate per unit 
volume is [17] 


7 
Vase _ p Mw .-8 p(T). oF (4.18) 


—— i ~eEorT , 


V — (aT) 


where Mw is the mass of the W gauge boson and k is the Boltzmann 
constant. The transition rate is thus still very suppressed. This result can 
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be extrapolated to high temperature symmetric phase. It was found that, 
in the symmetric phase, T > Tw, the transition rate is [18] 


T 
H see ive Tt, (4.19) 
where a is the fine-structure constant. Thus for T > Tgw, baryon number 


violating processes can be unsuppressed and profuse. 


4.1.2.2. C and CP Violation 


To illustrate the point that both C and CP violation are necessary in order 
to have baryogenesis, consider the case [19] in which superheavy X boson 
have baryon number violating interactions as summarized in Table 4.2. The 


Table 4.2. Baryon number violating 
decays of the superheavy X boson in 
the toy model. 


process branching fraction AB 


X > qq a 2/3 
X 7 l-a -1/3 
X > WW Qa -2/3 
X — gf 1-a@ 1/3 


baryon numbers produced by the decays of X and X are, 
2 1 1 


By = a(-5) +(1 -m (3) = -(a- >) ; (4.21) 


respectively. The net baryon number produced by the decays of the X, X 
pair is then, 
e= Bx + B= (a- 0). (4.22) 
If C or CP is conserved, a = @, it then leads to vanishing total baryon 
number, € = 0. 
To be more concrete, consider a toy model [19] which consists of four 


fermions, f1,...4, and two heavy scalar fields, X and Y. The interactions 
among these fields are described by the following Lagrangian, 


L=gXfifit 92X f] fs + gY FÌ fs + 4Y fi fa + h-c., (4.23) 
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where gi,...4 are the coupling constants. The Lagrangian £ leads to the 
following decay processes, 


Xfi the, fat fa; 
Y > f3+ fi, fat f, 


and the tree level diagrams of these decay processes are shown in Fig. 4.4. 
At the tree level, the decay rate of X > fı + fo is, 


T(X > fı + f2) = |x , (4.26) 


where Tx is the phase space factor. For the conjugate process X > fı + fo, 
the decay rate is, 


(4.24) 
(4.25) 


rX > fit fo) = lot Phe. (4.27) 
As the phase space factors Ix and I% are equal, no asymmetry can be 
generated at the tree level. 
At the one-loop level, there are additional diagrams, as shown in Fig. 4.5, 
that have to be taken into account. Including these one-loop contributions, 
the decay rates for X > fı + fo and X —> fı + fa become, 


Fig. 4.4. 


T(X > fı + f2) = ngogsgitxy + c-c. , (4.28) 
T(X > fit fo) = gig293gaIxy + c-c. , (4.29) 
fi fs 
---%_-g, ---%__-<g) 
(a) (b) 
fs fa 
---¥_--g; ee ee 
fi fb 
(c) (d) 


Tree level diagrams for the decays of the heavy scalar fields, X and Y. 
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where c.c. stands for complex conjugation. Now Ixy includes both the 
phase space factors as well as kinematic factors arising from integrating 
over the internal loop momentum due to the exchange of J in I decay. If 
fermions f;,..4 are allowed to propagate on-shell, then the factor Ixy is 
complex. Therefore, 


T(X > fı + fo) -V(X > fi + fe) = 4Im(Ixy )Im(gïg29394). (4.30) 
Similarly, for the decay mode, X > f, + fa, we have, 

[(X > fz fa) -T(X > fs + fa) = —4Im(Ixy )Im(g]g293g4). (4.31) 
Note that, in addition to the one-loop diagrams shown in Fig. 4.5, there are 
also diagrams that involve the same boson as the decaying one. However, 
contributions to the asymmetry from these diagrams vanish as the inter- 


ference term in this case is proportional to Im(gigžgigž) = 0. The total 
baryon number asymmetry due to X decays is thus given by, 


Bsa: (Bı — By)AT(X > fı + fe) + (Ba — Bs)AT(X > Fs + fa) 
Ty ; 


(4.32) 


Fig. 4.5. One loop diagrams for the decays of the heavy scalar fields, X and Y, that 
contribute to the asymmetry. 
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where 


AT(X > fi + fo) =T(X > fi + fe) -P(X > fit fe), (4.33) 
AT(X > fs + fa) =T(X > fs + fa) -T(X > fat fa). (4.34) 
Similar expression can be derived for the Y decays. The total asymmetries 


due to the decays of the superheavy bosons, X and Y, are then given, 
respectively, by 


ex = ln Ixy)ln(g}g29%.90)(Bs — Bs) — (B2 — Bı)] , (4.35) 


ey = “lin (T'cy )ln( 49093 94)((Bo — Ba) — (Bi — Bs)|. (4.36) 


By inspecting Eq. 4.35 and 4.36, it is clear that the following three 
conditions must be satisfied to have a non-zero total asymmetry, € = €x + 
ey: 


e The presence of the two baryon number violating bosons, each of 
which has to have mass greater than the sum of the masses of the 
fermions in the internal loop; 

The coupling constants have to be complex. The C and CP vio- 
lation then arise from the interference between the tree level and 


one-loop diagrams. In general, the asymmetry generated is propor- 
tional to € ~ a”, with n being the number of loops in the lowest 
order diagram that give non-zero asymmetry and a ~ g?/4z ; 

e The heavy particles X and Y must have non-degenerate masses. 
Otherwise, cx = —ey, which leads to vanishing total asymmetry, 
2 


4.1.2.3. Departure from Thermal Equilibrium 


The baryon number B is odd under the C and CP transformations. Using 
this property of B together with the requirement that the Hamiltonian, H, 
commutes with CPT, the third condition can be seen by calculating the 
average of B in equilibrium at temperature T = 1/2, 


< B >r = Tile?" B] = Tr[(CPT)(CPT)~1e-9" B)] (4.37) 
= Tr[e (CPT) B(CPT)| = —Tr[|e 7f” B] . 
In equilibrium, the average < B >r thus vanishes, and there is no genera- 


tion of net baryon number. Different mechanisms for baryogenesis differ in 
the way the departure from thermal equilibrium is realized. There are three 
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possible ways to achieve departure from thermal equilibrium that have been 
utilized in baryogenesis mechanisms: 


e Out-of-equilibrium decay of heavy particles: GUT Baryogenesis, 
Leptogenesis; 

e EW phase transition: EW Baryogenesis; 

e Dynamics of topological defects. 


In leptogenesis, the departure from thermal equilibrium is achieved 
through the out-of-equilibrium decays of heavy particles in an expanding 
Universe. If the decay rate I x of some superheavy particles X with mass 
Mx at the time when they become non-relativistic (i.e. T ~ Mx) is much 
smaller than the expansion rate of the Universe, the X particles cannot 
decay on the time scale of the expansion. The X particles will then remain 
their initial thermal abundance, nx = nx ~ n, ~ T°, for T Z Mx. In 
other words, at some temperature T > Mx, the superheavy particles X 
are so weakly interacting that they cannot catch up with the expansion of 
the Universe. Hence they decouple from the thermal bath while still being 
relativistic. At the time of the decoupling, nx ~ nx ~ T’. Therefore, they 
populate the Universe at T ~ Mx with abundance much larger than their 
abundance in equilibrium. Recall that in equilibrium, 


nx =nx~n, for T2 Mx, (4.38) 
nx =n ~ (MxT)’ Pe Mx/T <n, for TS Mx. (4.39) 


This over-abundance at temperature below Mx, as shown in Fig. 4.6, is 
the departure from thermal equilibrium needed to produce a final non- 
vanishing baryon asymmetry, when the heavy states, X, undergo B and 
CP violating decays. The scale of rates of these decay processes involving 
X and X relative to the expansion rate of the Universe is determined by 
Mx, 


E xX = f (4.40) 
The out-of-equilibrium condition, IT < H, thus requires very heavy states: 
for gauge bosons, Mx > (10!°~'®) GeV; for scalars, Mx = (1010716) GeV, 
assuming these heavy particles decay through renormalizable operators. 
Precise computation of the abundance is carried out by solving the Boltz- 
mann equations (more details in Sec. 4.2.1.2). 
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4.1.3. Relating Baryon and Lepton Asymmetries 


One more ingredient that is needed for leptogenesis is to relate lepton num- 
ber asymmetry to the baryon number asymmetry, at the high temperature, 
symmetric phase of the SM [1]. In a weakly coupled plasma with temper- 
ature T and volume V, a chemical potential u; can be assigned to each of 
the quark, lepton and Higgs fields, i. There are therefore 5Ny + 1 chem- 
ical potentials in the SM with one Higgs doublet and Ny generations of 
fermions. The corresponding partition function is given by, 


Z(p,T, V) = Tr[e E: niQ:)] (4.41) 
where 3 = 1/T, H is the Hamiltonian and Q; is the charge operator for 


the corresponding field. The asymmetry in particle and antiparticle num- 
ber densities is given by the derivative of the thermal-dynamical potential, 


nx/ny 


e 
ilihri deprture from thermal 
> 
Equilibrium equilibrium manifest 


in the over-production 
é of the X particles 


0.01 0.1 1 10 
In(Mx/T) 


Fig. 4.6. The distribution of the X particles in thermal equilibrium (blue curve) follows 
Eq. 4.38 and 4.39. When departure from the thermal equilibrium occurs, the distribution 
of the X particles remains the same as the thermal distribution (red dashed curve). 
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Q(u,T), as 
O(n, T 
where Q(u, T) is defined as, 
T 
O(n, T) = -7 ln Z(u,T, V). (4.43) 


For a non-interacting gas of massless particles, assuming Gu; < 1, 


= _1 maf Bui t O((Bpi)?), fermions 
Bes ce est te + O((Bp:)3), bosons . 


In the high temperature plasma, quarks, leptons and Higgs interact via 


(4.44) 


the guage and Yukawa couplings. In addition, there are non-perturbative 
sphaleron processes. All these processes give rise to constraints among 
various chemical potentials in thermal equilibrium. These include [1]: 


(1) The effective 12-fermion interactions Og+z induced by the sphalerons 
give rise to the following relation, 


X (Gua + ue.) = 0. (4.45) 


a 


— 
N 
So 


The SU(3) QCD instanton processes lead to interactions between LH 
and RH quarks. These interactions are described by the operator, 
I], (42.492; uk, dR, ). When in equilibrium, they lead to, 


Sita: — tas — Ha) = 0. (4.46) 


a 


(3) Total hypercharge of the plasma has to vanish at all temperatures. This 
gives, 


2 
DA @ + Qiu, — Md; — Bey — He; 4 Žun) =0. (4.47) 
f 


a 


(4) The Yukawa interactions yield the following relations among chemical 
potential of the LH and RH fermions, 


Mq, — HH — Hd; = 0 , (4.48) 
He, — HH — He; =0. (4.50) 
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From Eq. (4.44), the baryon number density ng = i g BT? and lepton num- 
ber density nz = ł gL;T?, where L; is the individual lepton flavor number 
with 7 = (e, u, T), can be expanded in terms of the chemical potentials. 
Hence 


B=) (ug + tus + Has) (4.51) 


L= So Li, Ly = 2p, + be, - (4.52) 


Consider the case where all Yukawa interactions are in equilibrium. The 
asymmetry (L;—B/Nf) is then preserved. If we further assume equilibrium 
among different generations, we, = pe and fg, = Hq, together with the 
sphaleron and hypercharge constraints, all the chemical potentials can then 
be expressed in terms of ue, 


_ 2Nf+3 — ON +1 _ 2N;s-1 
HeT ON ea Ha = 6NF F3” DETEN ite 
1 _  ANs (4.58) 
Mg = — gles MH = oN; p3) 
The corresponding B and L asymmetries are 
4 
B= =e 5 (4.54) 
14N?+9N 
f f 
= — u. 4.55 
6N; +3 He ( ) 
Thus B, L and B — L are related by: 
B=c,(B-L), L= (cs—-1)(B- L), (4.56) 
where 
SNe +4 
s = =. 4.57 
s = 22N; + 13 (sun 
For models with Ny Higgses, the parameter cs is given by, 
_ 8Nf+4Ny 
s = 32N; 4 3Ng eee) 


For T = 100 GeV ~ 10!% GeV, which is of interest of baryogenesis, 
gauge interactions are in equilibrium. Nervertheless, the Yukawa interac- 
tions are in equilibrium only in a more restricted temperature range. But 
these effects are generally small, and thus will be neglected in these lec- 
tures. These effects have been investigated recently; they will be discussed 
in Sec. 4.5. 
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4.1.4. Mechanisms for Baryogenesis and Their Problems 


There have been many mechanisms for baryogenesis proposed. Each has 
attractive and problematic aspects, which we discuss below. 


4.1.4.1. GUT Baryongenesis 


The GUT baryogenesis was the first implementation of Sakharov’s B- 
number generation idea. The B-number violation is an unavoidable con- 
sequence in grand unified models, as quarks and leptons are unified in the 
same representation of a single group. Furthermore, sufficient amount of 
CP violation can be incorporated naturally in GUT models, as there ex- 
ist many possible complex phases, in addition to those that are present in 
the SM. The relevant time scales of the decays of heavy gauge bosons or 
scalars are slow, compared to the expansion rate of the Universe at early 
epoch of the cosmic evolution. The decays of these heavy particles are thus 
inherently out-of-equilibrium. 

Even though GUT models naturally encompass all three Sakharov’s con- 
ditions, there are also challenges these models face. First of all, to generate 
sufficient baryon number asymmetry requires high reheating temperature. 
This in turn leads to dangerous production of relic particles, such as grav- 
itinos (see Sec. 4.2.3). As the relevant physics scale Mcur ~ 10'® GeV is 
far above the electroweak scale, it is also very hard to test GUT models ex- 
perimentally using colliders. The electroweak theory ensures that there are 
copious B-violating processes between the GUT scale and the electroweak 
scale. These sphelaron processes violate B+ L, but conserve B— L. There- 
fore, unless a GUT mechanism generates an excess of B — L, any baryon 
asymmetry produced will be equilibrated to zero by the sphaleron effects. 
As U(1) p_1 isa gauged subgroup of SO(10), GUT models based on SO(10) 
are especially attractive for baryogenesis. 


4.1.4.2. EW Baryogenesis 


In electroweak baryogenesis, the departure from thermal equilibrium is pro- 
vided by strong first order phase transition. The nice feature of this mech- 
anism is that it can be probed in collider experiments. On the other hand, 
the allowed parameter space is very small. It requires more CP violation 
than what is provided in the SM. Even though there are additional sources 
of CP violation in MSSM, the requirement of strong first order phase tran- 
sition translates into a stringent bound on the Higgs mass, my < 120 GeV. 


Leptogenesis 139 


To obtain a Higgs mass of this order, the stop mass needs to be smaller 
than, or of the order of, the top quark mass, which implies fine-tuning in 
the model parameters. 


4.1.4.3. Affleck-Dine Baryogensis 


The Affleck-Dine baryogenesis [20] involves cosmological evolution of scalar 
fields which carry B charges. It is most naturally implemented in SUSY 
theories. Nevertheless, this mechanism faces the same challenges as in GUT 
baryogenesis and in EW baryogenesis. 


4.1.5. Sources of CP Violation 


In the SM, C is maximally broken, since only LH electron couples to the 
SU(2)z gauge fields. Furthermore, CP is not an exact symmetry in weak 
interaction, as observed in the Kaon and B-meson systems. The charged 
current in the weak interaction basis is given by, 


g —" 
Lw = —Uzy"D_LW,, + hc. , 4.59 
Ww Wa LY VLW pu ( ) 


where Uz = (u,c,t), and Dz = (d,s,b);. Quark mass matrices can be 
diagonalized by bi-unitary transformations, 


diag(Mu, Me, me) = VÉ M” VÈ , (4.60) 
diag(ma,ms,ma) = VEM°V¢ . (4.61) 


Thus the charged current interaction in the mass eigenstates reads, 
oie 


v2 


where U} = VU; and D}, = VÉD: are the mass eigenstates, and Ucn = 
ViE(V?)t is the CKM matrix. For three families of fermions, the unitary 
matrix K can be parameterized by three angles and six phases. Out of 
these six phases, five of them can be reabsorbed by redefining the wave 
functions of the quarks. There is hence only one physical phase in the CKM 
matrix. This is the only source of CP violation in the SM. It turns out that 
this particular source is not strong enough to accommodate the observed 
matter-antimatter asymmetry. The relevant effects can be parameterized 
by [21], 


Lw Up Vey! DW, +h.c., (4.62) 


4 r3 
awT 


B~ dcp ~10-8dcp , (4.63) 
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where dcp is the suppression factor due to CP violation in the SM. Since 
CP violation vanishes when any two of the quarks with equal charge have 
degenerate masses, a naive estimate gives the effects of CP violation of the 
size 


Acp = (mz = me) (mè — my) (mu — m4) (4.64) 
(ms = ma) (ms — ma) (ma — mp) - J. 
Here the proportionality constant J is the usual Jarlskog invariant, which 
is a parameterization independent measure of CP violation in the quark 
sector. Together with the fact that Acp is of mass (thus temperature) 
dimension 12, this leads to the following value for dcp, which is a dimen- 
sionless quantity, 


Acp 


Te = 10720 ; (4.65) 


ÔCP = 
and Tc is the temperature of the electroweak phase transition. The baryon 
number asymmetry due to the phase in the CKM matrix is therefore of 
the order of B ~ 10778, which is too small to account for the observed 
B ~ 1071. 

In MSSM, there are new sources of CP violation due to the presence of 
the soft SUSY breaking sector. The superpotential of the MSSM is given 
by, 


W = pH He + h” A Âû + ht Âd +h A, Le . (4.66) 
The soft SUSY breaking sector has the following parameters: 


e tri-linear couplings: THQ + rH, Ode + T°, L + h.c., where 
Tu.de) = Aw ae) Z pide). 

e bi-linear coupling in the Higgs sector: uBH; Ho; 

e gaugino masses: M; for i = 1,2,3 (one for each gauge group); 

e soft scalar masses: mr. 


In the constrained MSSM (CMSSM) model with mSUGRA boundary con- 
ditions at the GUT scale, a universal value is assumed for the tri-linear 
) = A. Similarly, the gaugino masses and scalar 
masses are universal, M; = M, and mrs =m. Two phases may be removed 
by redefining the phase of Hy such that the phase of u is opposite to the 
phase of B. As a result, the product uB is real. Furthermore, the phase of 
M can be removed by R-symmetry transformation. This then modifies the 


coupling constants, A(¢%¢ 


tri-linear couplings by an additional factor of e~®”, while other coupling 
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constants are invariant under the R-symmetry transformation. There are 
thus two physical phases remain, 


a =Arg(AM), 6, =—Arg(B). (4.67) 


These phases are relevant in soft leptogenesis, which is discussed in 
Sec. 4.3.2. 

If the neutrinos are massive, the leptonic charged current interaction in 
the mass eigenstates of the leptons is given by, 


Ly = Uhn" L Wpa + h.c., (4.68) 
where Uys = (V¥)' V£. (For a review on physics of the massive neutrinos, 
see, e.g. Refs. [22] and [23]. See also Rabi Mohapatra’s lectures.) The 
matrices Vý and V£ diagonalize the effective neutrino mass matrix and 
the charged lepton mass matrix, respectively. If neutrinos are Majorana 
particles, which is the case if small neutrino mass is explained by the seesaw 
mechanism [24], the Majorana condition then forbids the phase redefinition 
of vg. Unlike in the CKM matrix, in this case only three of the six complex 
phases can be absorbed, and there are thus two additional physical phases 
in the lepton sector if neutrinos are Majorana fermions. And due to this 
reason, CP violation can occur in the lepton sector with only two families. 
(Recall that in the quark sector, CP violation can occur only when the 
number of famalies is at least three). The MNS matrix can be parameterized 
as a CKM-like matrix and a diagonal phase matrix, 


€12€13 §12C13 sıze"? 
UmnNs = | —512C23 — 128238136 C1223 — 8125238136? $23C13 
$12523 — €12€23813e —C€12523 — 81223813 €23C13 
1 
eto21/2 (4.69) 
et231/2 


The Dirac phase 6 affects neutrino oscillation (see Boris Kayser’s lectures), 


L 
P(va > vg) = bap -4X Re(Ua:Ug; U; U}:) sin? (an $) (4.70) 


i>j 


L 
X ep 2 
+2 Jee sin T mz) 


i>j 
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where the parameterization invariant CP violation measure, the leptonic 


Jarlskog invariant J2, is given by, 


_ Im(Ai2H23H31) 
Ami AMZ Ams 

The two Majorana phases, a2; and &31, affect neutrino double decay (see 

Petr Vogel’s lectures). Their dependence in the neutrinoless double beta 


H = (Melly MEIEN. (4.71) 


lep __ 
Je Ji 


decay matrix element is, 
|(Mee)|? = mi lUe |f + m3 |Ue2|f + m3 [Ues|” (4.72) 
+2m mo Ual |Ue2|" cos Q21 


+2mımg [Ver |? \Ueal? COS Q31 


+2m2m3 [Wes |" |Ues|" cos(a31 = Q21) $ 


The Lagrangian at high energy that describe the lepton sector of the 

SM in the presence of the right-handed neurinos, vp,, is given by, 
L= er, iy" Ouer, + Erig Oper, + Nr, iy". Nr, (4.73) 

+fijēr: l, HÝ + hijNr,é.,H E I ij Nr, NR; +h.c.. 
Without loose of generality, in the basis where fi; and Mij are diagonal, the 
Yukawa matrix hij is in general a complex matrix. For 3 families, h has nine 
phases. Out of these nine phases, three can be absorbed into wave functions 
of £,,. Therefore, there are six physical phases remain. Furthermore, a real 
hij can be diagonalized by a bi-unitary transformation, which is defined 
in terms of six mixing angles. After integrating out the heavy Majorana 
neutrinos, the effective Lagrangian that describes the neutrino sector below 

the seesaw scale is, 
Leff = 1, ty" One, + Er iY OLER, + fuEr: l H' (4.74) 
1 T H? 
ts XO hig hng lx £1; um the 
k 

This leads to an effective neutrino Majorana mass matrix whose parameters 
can be measured at the oscillation experiments. As Majorana mass matrix 
is symmetric, for three families, it has six independent complex elements 
and thus six complex phases. Out of these six phases, three of them can 
be absorbed into the wave functions of the charged leptons. Hence at low 
energy, there are only three physical phases and three mixing angles in 
the lepton sector. Going from high energy to low energy, the numbers of 
mixing angles and phases are thus reduced by half. Due to the presence 
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of the additional mixing angles and complex phases in the heavy neutrino 
sector, it is generally not possible to connect leptogenesis with low energy 
CP violation. However, in some specific models, such connection can be 
established. This will be discussed in more details in Sec. 4.4. 


4.2. Standard Leptogenesis 


4.2.1. Standard Leptogenesis (Majorana Neutrinos) 


As mentioned in the previous section, baryon number violation arises nat- 
urally in many grand unified theories. In the GUT baryogenesis, the asym- 
metry is generated through the decays of heavy gauge bosons (denoted by 
“V” in the following) or leptoquarks (denoted by “S” in the following), 
which are particles that carry both B and L numbers. In GUTs based on 
SU(5), the heavy gauge bosons or heavy leptoquarks have the following 
B-non-conserving decays, 


V = fu%, B=-1/3, B-L=2/3 (4.75) 
V > qLdẸ, B=2/3, B-L=2/3 (4.76) 
St,gq,, B=-1/3, B-L=2/3 (4.77) 
S > qraz, B=2/3, B-L=2/3. (4.78) 


Since (B — L) is conserved, i.e. the heavy particles V and S both carry 
(B — L) charges 2/3, no (B — L) can be generated dynamically. In addition, 
due to the sphaleron processes, (B) = (B — L} = 0. In SO(10), (B — L) 
is spontaneously broken, as it is a gauged subgroup of SO(10). Heavy 
particles X with Mx < Mp-z can then generate a (B — L) asymmetry 
through their decays. Nevertheless, for My ~ Meur ~ 10!° GeV, the 
CP asymmetry is highly suppressed. Furthermore, one also has to worry 
about the large reheating temperature Try ~ Maur after the inflation, 
the realization of thermal equilibrium, and in supersymmetric case, the 
gravitino problem. These difficulties in GUT baryogenesis had led to a lot 
of interests in EW baryogenesis, which also has its own disadvantages as 
discussed in Sec. 4.1.4. 

The recent advent of the evidence of neutrino masses from various neu- 
trino oscillation experiments opens up a new possibility of generating the 
asymmetry through the decay of the heavy neutrinos [25]. A particular 
attractive framework in which small neutrino masses can naturally arise is 
GUT based on SO(10) (for a review, see, i.e. Ref. [22]). SO(10) GUT 
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models accommodate the existence of RH neutrinos, 


Y(16) = (qL, UR, eR, dR, lx, VR)» (4.79) 
which is unified along with the fifteen known fermions of each family into 
a single 16-dimensional spinor representation. For hierarchical fermion 
masses, one easily has 


My & Mpg- ~ Maur , (4.80) 


where N = vr + vR is a Majorana fermion. The decays of the right-handed 
neutrino, 


NH, N>tH, (4.81) 


where H is the SU (2) Higgs doublet, can lead to a lepton number asymme- 
try. After the sphaleron processes, the lepton number asymmetry is then 
converted into a baryon number asymmetry. 

The most general Lagrangian involving charged leptons and neutrinos 
is given by, 


1 
Ly = fijER: EL; Hi + hijVr,ei,H = z MR)JuTR YR; +h.c.. (4.82) 


As the RH neutrinos are singlets under the SM gauge group, Majorana 
masses for the RH neutrinos are allowed by the gauge invariance. Upon 
the electroweak symmetry breaking, the SM Higgs doublet gets a VEV, 
(H) = v, and the charged leptons and the neutrino Dirac masses, which 
are much smaller than the RH neutrino Majorana masses, are generated, 


me = fv, mp=hv< Mr. (4.83) 


The neutrino sector is therefore described by a 2 x 2 seesaw matrix as, 


e a) (4.84) 


Diagonalizing this 2 x 2 seesaw matrix, the light and heavy neutrino mass 
eigenstates are obtained as, 


v2Vivul+Vive, Nevaetve (4.85) 
with corresponding masses 
1 
my ~ -VI mE —mpV,, mn ~ Me. (4.86) 
Mr 


Here the unitary matrix V, is the diagonalization matrix of the neutrino 
Dirac matrix. 
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At temperature T < Mp, RH neutrinos can generate a lepton number 
asymmetry by means of out-of-equilibrium decays. The sphaleron processes 
then convert AL into AB. 


4.21.1. The Asymmetry 


At the tree level, the ith RH neutrino decays into the Higgs doublet and 
the charged lepton doublet of a flavor, N; > H + la, where a = (e, u, T). 
The total width of this decay is, 


Pp, = SON > H + la) +T(Ni > H + 2a) (4.87) 


$ 


a 


1 
—(hh');;M, 
= ) 


II 


Suppose that the lepton number violating interactions of the lightest right- 
handed neutrino, Ni, wash out any lepton number asymmetry generated in 
the decay of N23 at temperatures T >> Mj. (For effects due to the decays 
of N23, see Ref. [26].) In this case with N, decay dominating, the final 
asymmetry only depends on the dynamics of N,. The out-of-equilibrium 
condition requires that the total width for Nı decay, 'p,, to be smaller 
compared to the expansion rate of the Universe at temperature T = Mj, 


Tp, <H l (4.88) 
T=M, 


That is, the heavy neutrinos are not able to follow the rapid change of the 
equilibrium particle distribution, once the temperature dropped below the 
mass Mı. Eventually, heavy neutrinos will decay, and a lepton asymmetry 
is generated due to the CP asymmetry that arises through the interference 
of the tree level and one-loop diagrams, as shown in Fig. 4.7, 


Pa [T(N > lof) -T(N 
aM > laH) +r(N 


= ataa ort [eCa +o Ce) 


In Fig. 4.7, the diagram (b) is the one-lop vertex correction, which gives 
the term, f(x), in Eq. 4.89 after carrying out the loop integration, 


fe) = ve[1-(+2)n(**)| (4.90) 


(4.89) 


la 
lH 
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Fig. 4.7. Diagrams in SM with RH neutrinos that contribute to the lepton number 
asymmetry through the decays of the RH neutrinos. The asymmetry is generated due 
to the interference of the tree-level diagram (a) and the one-loop vertex correction (b) 
and self-energy (c) diagrams. 


Diagram (c) is the one-loop self-energy. For |M; — M;| > |r; —T1|, the 
self-energy diagram gives the term 


W (4.91) 


7 


M >a. 


in Eq. 4.89. For hierarchical RH neutrino masses, Mı « Mə, Ms, the 
asymmetry is then given by, 


3 1 
->=—— X) Imf (hhh) ti} 
81 (hhb) me i 


Mı 


Ta (4.92) 


= 


Note that when Nx and N; in the self-energy diagram (c) have near degen- 
erate masses, there can be resonant enhancement in the contributions from 
the self-energy diagram to the asymmetry. Such resonant effect can allow 
M, to be much lower while still generating sufficient amount of the lepton 
number asymmetry. This will be discussed in Sec. 4.3.1. 

To prevent the generated asymmetry given in Eq. 4.89 from being 
washed out by the inverse decay and scattering processes, the decay of the 
RH neutrinos has to be out-of-equilibrium. In other words, the condition 

pe Ne og (4.93) 
H\r=m, (1.7)(327)./g Mı 
has to be satisfied. This leads to the following constraint on the effective 
light neutrino mass 


2 


mM = (h,hi Doni —— 
im = ( uaF Vox 


<10%eV, (4.94) 
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where gẹ, is the number of relativistic degrees of freedom. For SM, gx ~ 

106.75, while for MSSM, gą ~ 228.75. The wash-out effect is parameterized 

by the coefficient «, and the final amount of lepton asymmetry is given by, 
y= Pee gS (4.95) 

s gx 

where « parameterizes the amount of wash-out due to the inverse decays 

and scattering processes. The amount of wash-out depends on the size of 

the parameter r: 


(1) If r < 1 for decay temperature Tp < Mx, the inverse decay and 2-2 


scattering are impotent. In this case, the inverse decay width is given 
by, 


3/2 
TID", (=) eoMxIT op (4.96) 


I's ~va) Pi (4.97) 


Thus the inverse decays and scattering processes can be safely ig- 
nored, and the asymmetry AB produced by decays is not destroyed 
by the asymmetry —AB produced in inverse decays and scatterings. 
At T ~ Tp, the number density of the heavy particles X has thermal 
distribution, nx ~ nx œ ny. Thus the net baryon neumber density 
produced by out-of-equilibrium decays is 


nL =& ny YQ ny. (4.98) 


(2) For r > 1, the abundance of X and X follows the equilibrium values, 
and there is no departure from thermal equilibrium. As a result, no 
lepton number may evolve, and the net lepton asymmetry vanishes, 


Ne — nyg 


dt 


+ 3H (n — nz) = Ay? = 0. (4.99) 


In general, for 1 < r < 10, there could still be sizable asymmetry. The 
wash out effects due to inverse decay and lepton number violating scattering 
processes together with the time evolution of the system is then accounted 
for by the factor x, which is obtained by solving the Boltzmann equations 
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N 
we 
(b) 
Fig. 4.8. Decay and inverse decay processes in the thermal bath. 

for the system (see next section). An approximation is given by [19], 
10° Ş r : g = (0.1r)2e750-D* (< 1077) (4.100) 
10r <10: k= i= (1072 ~ 1077) (4.101) 
0Sr[10:k= NEE (10-1 ~ 107?) . (4.102) 

The EW sphaleron effects then convert Yz into Yp, 
Yp = ea = Ei Y; (4.103) 
8 c—1 


where c is the conversion factor derived in Sec. 4.1.3. 


4.2.1.2. Boltzmann Equations 


As the decays of RH neutrinos are out-of-equilibrium processes, they are 
generally treated by Boltzmann equations. Main processes in the thermal 
bath that are relevant for leptogenesis include, 


N t N 


1 q t ! q 
(a) (b) 


Fig. 4.9. The AL = 1 scattering processes in the thermal bath. 


(1) decay of N (Fig. 4.8 (a)): 
N—£+H, N-0+H (4.104) 
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(a) (b) (c) 
Fig. 4.10. The AL = 2 scattering processes in the thermal bath. 
(2) inverse decay of N (Fig. 4.8 (b)): 
L+H >N, +H —>N (4.105) 


(3) 2-2 scattering: These include the following AL = 1 scattering processes 
(Fig. 4.9), 


[s-channel] : Nlotg , NMtetq (4.106) 

[t-channel] : Nitol¢d , Mtolg (4.107) 
and AL = 2 scattering processes (Fig. 4.10), 

LHH , UHH, HHH. (4.108) 


Basically, at temperatures T > Mı, these AL = 1 and AL = 2 processes 
have to be strong enough to keep Nj, in equilibrium. Yet at temperature 
T < Mı, these processes have to be weak enough to allow N; to generate 
an asymmetry. 

The Boltzmann equations that govern the evolutions of the RH neutrino 
number density and B — L number density are given by [27], 


dNy, 


a = (D+ 8)(Nw, — NM) (4.109) 
dNg-L 2 
a = ~aD(Nw, — NM) — WNB- (4.110) 
where 
Tp, Is, I M 
(D, S,W) = Wateatwi gM (4.111) 


Hz , T~, 
Here Ip includes both decay and inverse decay, I's includes AL = 1 
scattering processes and I'w includes inverse decay and AL = 1, AL = 2 
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scattering processes. The N abundance is affected by the decay, inverse 
decay and the AL = 1 scattering processes. It is manifest in Eq. 4.110 
that the Nı decay is the source for (B — L), while the inverse decay and 
the AL = 1, 2 scattering processes wash out the asymmetry. The generic 
behavior of the solutions to the Boltzmann equations is shown in Fig. 4.11. 


4.2.1.3. Bounds on Neutrino Masses 


In the case with strongly hierarchical right-handed neutrino masses, when 
the asymmetry €, due to the decay of the lightest right-handed neutrino, 
N,, contribute dominantly to the total asymmetry, leptogenesis becomes 
very predictive [1, 4, 27], provided that N, decays at temperature T > 10 
GeV. In particular, various bounds on the neutrino masses can be obtained. 

For strongly hierarchyical masses, Mı /Mə2 « 1, there is an upper bound 


Mm! 1/7 


Fig. 4.11. Generic behavior of the solutions to Boltzmann equations. Here the functions 
Ny, (red solid curve) and Ng_y (green solid curve) are solutions to Eq. 4.109 and 4.110. 
The function (Ny, )°% (blue dotted curve) is the equilibrium particle distribution. 
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on €, [29], called the “Davidson-Ibarra” bound, 
a ee ee 4.112 
lal < > Pl, (4.112) 


which is obtained by expanding € to leading order in M,/M>. Becuase 
[ma — mə| < V/Amés, ~ 0.05 eV, a lower bound on M; then follows, 


M; > 2 x 10° GeV. (4.113) 


This bound in turn implies a lower bound on the reheating temperature, 
Tru, and is in conflict with the upper bound from gravitino over production 
constraints if supersymmetry is incorporated. We will come back to this 
in Sec. 4.2.3. One should note that, in the presence of degenerate light 
neutrinos, the leading terms in an expansion of €, in Mı/Mə and Mı /M3 
vanish. However, the next to leading order terms do not vanish and in this 
case one has [30], 


pr _M3 
€1| £ Max| e>, . 4.114 
als naxe 7) (4114) 

By requiring that there is no substantial washout effects, bounds on 
light neutrino masses can be derived. To have significant amount of baryon 
asymmetry, the effective mass m, defined in Eq. 4.94 cannot be too large. 
Generally Mı < 0.1 — 0.2 is required. As the mass of the lightest active 
neutrino mı S Mı, an upper bound on mı thus ensues. By further requir- 
ing the AL = 2 washout effects be consistent with successful leptogenesis 
impose a bound on, 


\/ (m2 + m3 + m3) S (0.1 — 0.2) eV, (4.115) 


which is of the same order as the bound on Mı. From these bounds, the 
absolute mass scale of neutrino masses is thus known up to a factor of 
~ 3 to be in the range, 0.05 S ma < 0.15 eV [4], if the observed baryonic 
asymmetry indeed originates from leptogenesis through the decay of N1. 


4.2.2. Dirac Leptogenesis 


In the standard leptogenesis discussed in the previous section, neutrinos 
acquire their masses through the seesaw mechanism. The decays of the 
heavy right-handed neutrinos produce a non-zero lepton number asymme- 
try, AL # 0. The electroweak sphaleron effects then convert AL partially 
into AB. This standard scenario relies crucially on the violation of lepton 
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number, which is due to the presence of the heavy Majorana masses for the 
right-handed neutrinos. 

It was pointed out [31] that leptogenesis can be implemented even in the 
case when neutrinos are Dirac fermions which acquire small masses through 
highly suppressed Yukawa couplings without violating lepton number. The 
realization of this depends critically on the following three characteristics 
of the sphaleron effects: (i) only the left-handed particles couple to the 
sphalerons; (i) the sphalerons change (B + L) but not (B — L); (iii) the 
sphaleron effects are in equilibrium for T > Try. 

As the sphelarons couple only to the left-handed fermions, one may 
speculate that as long as the lepton number stored in the right-handed 
fermions can survive below the electroweak phase transition, a net lepton 
number may be generated even with L = 0 initially. The Yukawa couplings 
of the SM quarks and leptons to the Higgs boson lead to rapid left-right 
equilibration so that as the sphaleron effects deplete the left-handed (B+L), 
the right-handed (B + L) is converted to fill the void and therefore it is 
also depleted. So with B = L = 0 initially, no baryon asymmetry can 
be generated for the SM quarks and leptons. For the neutrinos, on the 
other hand, the left-right equilibration can occur at a much longer time 
scale compared to the electroweak epoch when the sphaleron washout is in 
effect. The left-right conversion for the neutrinos involves the Dirac Yukawa 
couplings, \0;Hvp, where A is the Yukawa coupling constant, and the rate 
for these conversion processes scales as, 


Ter ~AT. (4.116) 


For the left-right conversion not to be in equilibrium at temperatures above 
some critical temperature Teq, requires that 


Tir SH, for T >T, (4.117) 


where the Hubble constant scales as, 


(4.118) 


Hence the left-right equilibration can occur at a much later time, T < 
Teq < Tew, provided, 
Teg Tew 


K ; 
Mp, Mp, 


(4.119) 
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Fig. 4.12. With sufficiently small Yukawa couplings, the left-right equilibration occurs 
at a much later time, well below the electroweak phase transition temperature. It is 
therefore possible to generate a non-zero baryon number even if B = L = 0 initially. For 
the SM particles, as shown in the insert for comparison, the left-right equilibration takes 
place completely before or during the sphaleron processes. Thus no net baryon number 
can be generated if B — L = 0 initially. Figure taken from Ref [31]. 


With Mp, ~ 101° GeV and Tgw ~ 10? GeV, this condition then translates 
into 


A < 1-8) (4.120) 


Thus for neutrino Dirac masses mp < 10 keV, which is consistent with all 
experimental observations, the left-right equilibration does not occur until 
the temperature of the Universe drops to much below the temperature of 
the electroweak phase transition, and the lepton number stored in the right- 
handed neutrinos can then survive the wash-out due to the sphalerons [31]. 

Once we accept this, the Dirac leptogenesis then works as follows. Sup- 
pose that some processes initially produce a negative lepton number (AL z), 
which is stored in the left-handed neutrinos, and a positive lepton number 
(ALR), which is stored in the right-handed neutrinos. Because sphalerons 
only couple to the left-handed particles, part of the negative lepton number 
stored in left-handed neutrinos get converted into a positive baryon number 
by the electroweak anomaly. This negative lepton number AL, with re- 
duced magnitude eventually equilibrates with the positive lepton number, 
ALR when the temperature of the Universe drops to T < Tew. Because 
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the equilibrating processes conserve both the baryon number B and the 
lepton number L separately, they result in a Universe with a total positive 
baryon number and a total positive lepton number. And hence a net baryon 
number can be generated even with B = L = 0 initially. 

Such small neutrino Dirac Yukawa couplings required to implement 
Dirac leptogensis are realized in a SUSY model proposed in Ref. [32]. 


4.2.3. Gravitino Problem 


For leptogenesis to be effective, as shown in Sec. 4.2.1.3, the mass of the 
lightest RH neutrino has to be Mı > 2 x 10° GeV. Figure 4.13 shows the 
lower bound on the lightest RH neutrino mass as a function of the low 
energy effective lightest neutrino mass, Mı [28, 33]. If RH neutrinos are 
produced thermally, the reheating temperature has to be greater than the 
right-handed neutrino mass, Try > Mr. This thus implies that Try > 2x 
10° GeV, in order to generate sufficient baryon number asymmetry. Such a 
high reheating temperature is problematic as it could lead to overproduction 
of light states, such as gravitinos [34, 35]. If gravitinos are stable (i.e. LSP), 
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Fig. 4.13. Lower bound on the lightest RH neutrino mass, Mı (circles) and the initial 
temperature, T; (dotted line), for mı = 0 and ng” = 6 x 10-19. The red circles (solid 
lines) denote the analytical (numerical) results. The vertical dashed lines indicate the 
range (\/Am2,, \/Am?2,,,). Figure taken from Ref. [28]. 
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Table 4.3. Photo-dissociation reactions that the high en- 
ergy photons can participate in. The light elements may be 
destroyed through these reactions and thus their abundance 
may be changed. 


reaction threshold (MeV) 
D+y7>n+p 2.225 
T+y>n+D 6.257 
T+y>p+n++n 8.482 
3He+y>p+D 5.494 
4He+y—>p+T 19.815 
4He+y—> n+? He 20.578 
4He+y—>pt+n+D 26.072 


WMAP constraint on DM leads to stringent bound on gluino mass for any 
given gravitino mass m3/2 and reheating temperature Try. (Bounds on 
other gaugino masses can also be obtained as discussed in [36].) On the 
other hand, if gravitinos are unstable, it has long lifetime and can decay 
during and after the BBN, and may have the following three effects on 
BBN [1]: 


(1) These decays can speed up cosmic expansion, and increase the neutron 
to proton ratio and thus the He abundance; 

(2) Radiation decay of gravitinos, Y — y +7, increases the photon density 
and thus reduces the ng/n, ratio; 

(3) High energy photons emitted in gravitino decays can destroy light el- 
ements (D, T, ?He, “He) through photo-dissociation reactions such as 
those given in Table 4.3; 


The gravitino number density, n3/2, during the thermalization stage 
after the inflation is governed by the following Boltzmann equation [35], 


d 
G32 + 3Hns/2 = (x v) “He (4.121) 


tot 


where >>, ~ 1/M3?, is the total cross section determining the production 
rate of gravitinos and Nien ~ T? is the number density of light particles 
in the thermal bath. As the thermalization is very fast, the friction term 
3H7n3/2 in the above Boltzmann equation can be neglected. Using the fact 
that the Universe is radiation dominant, H ~ t! ~ T?/Mp, it follows 
that, 

T4 
Mp ? 


(4.122) 


3/2 ~ 


156 M.-C. Chen 


1OB (7 a7 SCC) 
1010 E 1010 
10° 10° 
108 108 
> 10? 107 
O 

p 108 108 
= 105 105 
104 B(y,>gč)=1 104 

n=(6.140.3)x10-19 
a E e= Mg/2/2 1o: 
104 Loti uul a bea uul (Ea E | Luul 102 

10? 103 104 108 


M572 (GeV) 


Fig. 4.14. Upper bound on reheating temperature as a function of the gravitino mass, 
for the case when gravitino dominant decays into a gluon-gluino pair. Figure taken from 
Ref. [37]. 


and the number density at thermalization in unit of entropy then reads, 


3/2 _oTRH 
— ~ 10 : 4.123 
s Mp ( ) 


The observed abundances for various light elements are, 


0.22 < Y, = (page /PB)p < 0.24, (4.124) 
(np/ny) > 1.8 x 107°, (4.125) 


(=) < 1074. (4.126) 
nH p 
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Table 4.4. Upper bound on the reheating temperature for differ- 
ent values of gravitino mass. 


Gravitino mass m3/2 Upper bound on TR 
<100 GeV 106-77 GeV 

100 GeV —1 TeV 107-9 GeV 
1 TeV —3 TeV 109712 GeV 
3 TeV —10 TeV 1012 GeV 


The most stringent constraint is from the abundance of (D + *He) which 
requires the gravitino number density to be 


P9/2 1g -2 TRH < 49-12, (4.127) 
S Pl 
The constraint Try < 10879 GeV then follows. More recently, it has 
been shown that, for hadronic decay modes, ~ — g +g, the bounds are 
even more stringent, Tr < 108677 GeV, for gravitino mass m3 /2 ~ 100 
GeV [37]. Fig. 4.14 shows the upper bound on the reheating temperature, 
Tr, for different values of gravitino mass, m3/2. Table 4.4 summarizes the 
numerical results for the upper bound on Tp for various values of m3 2. It 
has also been pointed out [38] that including recent constraint from ° Li, a 
new upper bound Tg < 107 GeV can be derived for the case of gravitino LSP 
in the constrained minimal supersymmetric Standard Model (CMSSM). 
There is therefore a conflict between generation of sufficient amount of 
leptogenesis and not overly producing gravitinos. To avoid these conflicts, 
various non-standard scenarios for leptogenesis have been proposed. These 
are discussed in the next section. 


4.3. Non-standard Scenarios 


There are a few non-standard scenarios proposed to evade the gravitino 
over-production problem. In these new scenarios, the conflicts between 
leptogenesis and gravitino over-production problem are overcome by, (i) 
resonant enhancement in the self-energy diagrams due to near degenerate 
right-handed neutrino masses (resonant leptogenesis); (ii) relaxing the rela- 
tion between the lepton number asymmetry and the right-handed neutrino 
mass (soft leptogenesis); (iii) relaxing the relation between the reheating 
temperature and the right-handed neutrino mass (non-thermal leptogene- 
sis). These scenarios are discussed below. 
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4.3.1. Resonant Leptogenesis 


Recall that in the standard leptogenesis discussed in Sec. 4.2, contributions 
to the CP asymmetry is due to the interference between the tree-level and 
the one-loop diagrams, that include the vertex correction and self-energy 
diagrams. It was pointed out in Ref. [39] that in the limit My, — My, « 
My,, the self-energy diagrams dominate, 


Self Im[(hv ht ig}? (M? = Mj)MTn, 
Me ah ay Me Ae (4.128) 
(hLAb) (Ry hb)s; LG — Mz)? + MTh, 

When the lightest two RH neutrinos have near degenerate masses, M? — 
Mz ~ Th, the asymmetry can be enhanced. To be more specific, CP 
asymmetry of O(1) is possible, when 

I m(hyhh) to 
(huhi) (hy hb)22 
Due to this resonant effect, the bound on the RH neutrino mass scale from 
the requirement of generating sufficient lepton number asymmetry can be 


significantly lower. It has been shown that sufficient baryogenesis can be 
obtained even with M1,2 ~ TeV [40]. 


1 
Mı — M2 ~ lN, , assuming 


~l. 4.12 
> (4.129) 


4.3.2. Soft Leptogenesis 


CP violation in leptogenesis can arise in two ways: it can arise in decays, 
which is the case in standard leptogenesis described in the previous section. 
It can also arise in mixing. An example of this is the soft leptogenesis. 
Recall that in the Kaon system, non-vanishing CP violation exists due to 
the mismatch between CP eigenstates and mass eigenstates (for a review, 
see for example, Ref. [41]). The CP eigenstates of the K° system are 
Z (|K°) + |K’)). The time evolution of the (K°, K’) system is described 
by the following Schrodinger equation, 


E (z) -H (z) (4.130) 


where the Hamiltanian 1 is given by H = M — ŁA. Here, the off-diagonal 
matrix element Mj 2 describes the dispersive part of the transition ampli- 
tude, while the element Aj2 gives the absorptive part of the amplitude. 
The physical (mass) eigenstates, |K L,s), are given in terms of the flavor 
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eigenstates, |K°) and |’), as 
|Kr) = p|K°) +q|K) (4.131) 
|Ks) =p|K°) — q| K°) . (4.132) 


To have non-vanishing CP violation requires that there exists a mismatch 
between the CP eigenstates and the physical eigenstates. This in turn 


implies, 
2 : 
q q 2Mï2 — =i) 
= 1, where 2] = | =]. 4.133 
R g (4) Corer ee) 


For soft leptogenesis, the relevant soft SUSY Lagrangian that involves 
lightest RH sneutrinos vp, is the following, 


1 at A ee 
—Lsoft = (BMi?m Pn, + AV Livre, Hu + he.) 


+m dh DR, - (4.134) 


This soft SUSY Lagrangian and the superpotential that involves the lightest 
RH neutrino, N, 


W = MiNi Nı + VuiliNi Au (4.135) 
give rise to the following interactions 
-L4 = Tp, (MYGG Hi + Vitali, + VGH.) the. , (4.136) 


and mass terms (to leading order in soft SUSY breaking terms), 


eck, ox ae 
-Lm = (MTh Dp, + zBM:PR PR) + h.c.. (4.137) 


Diagonalization of the mass matrix M for the two states Vr, and Th, leads 
to eigenstates Ny and N_ with masses, 


|B| 
M+ ~ Mı| 1 4.138 
sam(is L), (4.138) 
where the leading order term M; is the F-term contribution from the su- 
perpotential (RH neutrino mass term) and the mass difference between the 
two mass eigenstates N, and N_ is induced by the SUSY breaking B term. 
The time evolution of the Drv, system is governed by the Schrodinger 


equation, 
aren ee =H ~ ; (4.139) 
dt Dt. Dh. 
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where the Hamiltonian is H = M — $A with M and A being [42, 43], 


Ai BE 

M=| } i Mı, (4.140) 
2Mı 
y AL 

A=| 4 J|. (4.141) 
í 1 


For the decay of the lightest RH sneutrino, Ùr, , the total decay width T4 is 
given by, in the basis where both the charged lepton mass matrix and the 
RH neutrino mass matrix are E 


ry Lowi )uMı. (4.142) 


The eigenstates of the Hamiltonian H are N, = pN + qNt, where |p|? + 
|q|/? = 1. The ratio q/p is given in terms of M and T; as, 


2 
q 2Mi, — i Aïo 2r A 
=) = = xl] +I 4.143 
(4) 2M 12 — iA12 TA BM, )’ ( ) 


in the limit A2 K Mj. Similar to the K? — i system, the source of 
CP violation in the lepton number asymmetry considered here is due to 
the CP violation in the mixing which occurs when the two neutral mass 
eigenstates (N, N_), are different from the interaction eigenstates, (Ni, 
N2). Therefore CP violation in mixing is present as long as the quantity 


\q/p| £1, which requires 
im( a ) #0. (4.144) 


For this to occur, SUSY breaking, i.e. non-vanishing A and B, is required. 
As the relative phase between the parameters A and B can be rotated away 
by an U(1) r-rotation as discussed in Sec. 4.1.5, without loss of generality 
we assume from now on that the remaining physical phase is solely coming 


from the tri-linear coupling, A. 

The total lepton number asymmetry integrated over time, €, is defined 
as the ratio of the difference to the sum of the decay widths T for yz, and 
Dh, into final states of the slepton doublet L and the Higgs doublet H, or 


the lepton doublet L and the Higgsino H or their conjugates, 


Esh EER, Ph, > A) -TOR Fh, > Fl 


sO E E a a; (4.145) 
Brol T( (Čr Ph, = f) +T (Vr, ,¥R, > f) 
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Here the final states f = (L H), (L H) have lepton number +1, and f 


denotes their conjugate, (Lt Ht), (L H), which have lepton number —1. 
After carrying out the time integration, the total CP asymmetry is [42, 43], 


40;B \Im(A) 
Ff ieee ee 2 4.14 
. (ie) Mee een 


where the additional factor ôg—p takes into account the thermal effects 
due to the difference between the occupation numbers of bosons and 
fermions [44]. 

The final result for the baryon asymmetry is [42, 43], 


= ~ —Cs dor EK, 
~ —1.48 x 1078eK, 
Im(A 
~ —(1.48 x 0-9 (St) Rosh, (4.147) 
1 


where dj, in the first line is the density of the lightest sneutrino in equi- 
librium in units of entropy density, and is given by, dy, = 45¢(3)/(a*g«); 
the factor cs, which characterizes the amount of B — L asymmetry being 
converted into the baryon asymmetry Ypg, is defined in Eq. 4.57. The pa- 
rameter « is the efficiency factor given in Sec. 4.2.1.2. The resonance factor 
R is defined as the following ratio, 


41, B 
R==—]> > (4.148) 
T? + 4B? 


which gives a value equal to one when the resonance condition, rı = 2|Bl, 
is satisfied, leading to maximal CP asymmetry. As I; is of the order 
of O(0.1 — 1) GeV, to satisfy the resonance condition, a small value for 
B «& M is thus needed. Such a small value of B can be generated by some 
dynamical relaxation mechanisms [45] in which B vanishes in the leading 
order. A small value of B ~ m?/M, is then generated by an operator 
f@OZZ1N? /M;, in the Kahler potential, where Z is the SUSY breaking 
spurion field, Z = 6% mM, [43]. In a specific SO(10) model constructed 
in Refs. [46, 47], it has been shown that with the parameter B’ = VBM: 
having the size of the natural SUSY breaking scale Vm? ~ O(1) TeV, a 
small value for B required by the resonance condition B ~ rı ~ O(0.1) 
GeV can be obtained. 


162 M.-C. Chen 


4.3.3. Non-thermal Leptogenesis 


The conflict between generating sufficient leptogenesis and not overly pro- 
ducing gravitinos in thermal leptogenesis arises due to strong dependence 
of the reheating temperature Tr on the lightest RH mass, Mp,, in thermal 
leptogenesis. This problem may be avoided if the relation between the re- 
heating temperature and the lightest RH neutrino mass is loosened. This is 
the case if the primordial RH neutrinos are produced non-thermally. One 
possible way to have non-thermal leptogenesis is to generate the primordial 
right-handed neutrinos through the inflaton decay [48]. 

Inflation solves the horizon and flatness problem, and it accounts for the 
origin of density fluctuations. Assume that the inflaton decays dominantly 
into a pair of lightest RH neutrinos, 6 — Nı + N1. For this decay to occur, 
the inflaton mass mg has to be greater than 2M,. For simplicity, let us 
also assume that the decay modes into N23 are energetically forbidden. 
The produced N; in inflaton decay then subsequently decays into H + £z 
and Ht + ti. The out-of-equilibrium condition is automatically satisfied, 
if Tr < Mı. The CP asymmetry is generated by the interference of tree 
level and one-loop diagrams, 


3 Mı 
€ = —— ——sM3bef pf , 4.149 
Ba (Hye ess (4.149) 
where deff is given in terms of the neutrino Yukawa matrix elements and 
light neutrino masses as, 


Imd hi + Ban + Bar} 


deff = (4.150) 


[ms + [hat + [ha 


Numerically, the asymmetry is given by [48], 


M m 
A Oo are = pail aes : 
cere wae (a FE Jis f ie!) 


The chain decays ® — Nı + N; and Ni —> H +£; or Ht +É reheat the 
Universe producing not only the lepton number asymmetry but also the 
entropy for the thermal bath. Taking such effects into account, the ratio of 
lepton number to entropy density after the reheating [48] is then, 


NL 3 TR —10 TR M, m3 
Os 5 E S T R eL 4.152 
z me r ESN eN A me Aana) e Ae 
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assuming deff = 1. The ratio ng/s ~ 107!° can thus be obtained with 
Mı < Me, and Tr < 10° GeV. 


4.4. Connection between leptogenesis and neutrino 
oscillation 


As mentioned in Sec. 4.1.5, there is generally no connection between low 
energy CP violating processes, such as CP violation in neutrino oscillation 
and in neutrinoless double beta decay, and leptogenesis, which occurs at 
very high energy scale. This is due to the extra phases and mixing angles 
present in the heavy neutrino sector. One way to establish such connection 
is by reducing the inter-family couplings (equivalently, by imposing texture 
zero in the Yukawa matrix). This is the case for the 3 x 2 seesaw model. A 
more powerful way to obtain such connection is to have all CP violation, 
both low energy and high energy, come from the same origin. This ensues if 
CP violation occurs spontaneously. Below we described these two models 
in which such connection does exist. 


4.4.1. Models with Two Right-Handed Neutrinos 


One type of models where there exists connection between CP violating 
processes at high and low energies is models with only two RH neutrinos. In 
this case, the neutrino Dirac mass matrix is a 3x2 matrix. This 3x2 Yukawa 
matrix has six complex parameters, and hence six phases, out of which, 
three can be absorbed by the wave functions of the three charged leptons. 
Even though, the reduction in the number of right-handed neutrinos reduces 
the number of CP phases in high energy, it also reduces the number of CP 
phases at low energy to two. There is therefor still one high energy phase 
that cannot be determined by measuring the low energy phases. However, 
if one further assumes that the 3 Yukawa matrix has two zeros, there is 
then only one CP phase in the Yukawa matrix, making the existence of the 
connection possible. 

The existence of two right-handed neutrinos is required by the cancel- 
lation of Witten anomaly, if a global leptonic SU(2) family symmetry is 
imposed [49]. (For implications of non-anomalous gauge symmetry for neu- 
trino masses, see Ref. [50]. This model provided the interesting possibility 
of probing the neutrino sector at the colliders through their couplings to the 
Z’ gauge boson [51].) Along this line, Frampton, Glashow and Yanagida 
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proposed a model, which has the following Lagrangian [52], 


ey 
lb M, 0 Ni a a’ 0 
Ea a) (ng) tO) (9 b J es 
3 


(4.153) 
with the Yukawa matrix having two zeros in the Nı — é3 and Nə — 4 
couplings. The effective neutrino mass matrix due to this Lagrangian is 
obtained, using the see-saw formula, 


a aa! 0 


M; 3 Mı 4 
aa a b bb’ 
Mı Mı F Mə M: $ (4.154) 
0 bb’ p? 
Mz Mz 


where a,b, b' are real and a! = |a'|et?. By takinging all of them to be real, 
with the choice a’ = 2a and b = b', and assuming a?/M, < b?/Mz, the 
effective neutrino masses and mixing matrix are obtained 


2a? 2b? 


Mp, = 0, Mis = A Mois = M (4.155) 
1/vV2 1/72 0 1 0 0 
U= | -1/2 1/2 1/V2}x {0 cosð sin |, (4.156) 


1/2 —1/2 1//2 0 —sin@ cos0 


where 0 ~ my, /V2m,;, and the observed bi-large mixing angles and Am?,,, 


and Am?2, can be accommodated. An interesting feature of this model is 
that the sign of the baryon number asymmetry (B « Eg = Y?a?b? sin 26) 
is related to the sign of the CP violation in neutrino oscillation (osc) in 
the following way 

atb’ 


= (2ye —B 4.157 
MM3 | + Ep x ( ) 


Eose = 


assuming the baryon number asymmetry is resulting from leptogenesis due 
to the decay of the lighter one of the two heavy neutrinos, N;. This idea 
can be realized in a SO(10) with additional singlets [53]. 


4.4.2. Models with Spontaneous CP Violation (€ Triplet 
Leptogenesis) 


The second type of models in which relation between leptogenesis and low 
energy CP violation exists is the minimal left-right symmetric model with 
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spontaneous CP violation (SCPV) [54]. The left-right (LR) model [55] is 
based on the gauge group, SU(3).x SU (2) x SU (2)rXU(1)B -L xP, where 
the parity P acts on the two SU(2)’s. (See also Kaladi Babu’s lectures.) 
In this model, the electric charge Q can be understood as the sum of the 
two T? quantum numbers of the SU(2) gauge groups, 


1 
Q = Tzr + Tzr t 3(B - £) s (4.158) 


The minimal LR model has the following particle content: In the fermion 
sector, the iso-singlet quarks form a doublet under SU(2) pr, and similarly 


for er and vp, 
u 


Q= (4) M020 Qr= (7) ~ 01218 


Dib = (5) ~ (1/2,0,—1), Lir = a ~ (0,1/2, —1). 
EERE YS aR 


In the scalar sector, there is a bi-doublet and one triplet for each of the 
SU (2)’s, 
ON + 
o=( 1 , ) ~ 0/2 1/2, 0) 
O(S aj 
a=( Ao BAT 2 ~ (1, 0, 2) 
Agva Apt 
Under the parity P, these fields transform as, 
V= Ur, ApoAr, Dedi. (4.159) 
The VEV of the SU(2)r breaks the left-right symmetry down to the SM 
gauge group, 
SU(3)c x SU(2)r x SU(2)r x U(1)s-L x P 
— SU(3). x SU (2) x U(1)Y , (4.160) 


and the subsequent breaking of the electroweak symmetry is achieved by 
the bi-doublet VEV. In general, 


(6) = Ea a a (4.161) 
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To get realistic SM gauge boson masses, the VEV’s of the bi-doublet Higgs 
must satisfy v? = |x|? + |k |? ~ 2M2/g? ~ (174 GeV)2. Generally, a non- 
vanishing VEV for the SU(2), triplet Higgs is induced, and it is suppressed 
by the heavy SU(2)r breaking scale similar to the see-saw mechanism for 
the neutrinos, 


0 0 
vrer 0 


< Az >= ( ) ; vLUR = lkl? , (4.162) 
where the parameter (@ is a function of the order O(1) coupling constants 
in the scalar potential and vr, vz, k and x’ are positive real numbers in 
the above equations. (The presence of a triplet Higgs in warped extra 
dimensions can provide a natural way to generate small Majorana masses 
for the neutrinos [56].) Due to this see-saw suppression, for a SU(2)r 
breaking scale as high as 1015 GeV, which is required by the smallness of 
the neutrino masses, the induced SU(2)r triplet VEV is well below the 
upper bound set by the electroweak precision constraints [57]. The scalar 
potential that gives rise to the vacuum alignment described can be found 
in Ref. [58]. 

The Yukawa sector of the model is given by Lyuk = Lq + Le, where 
Lq and Le are the Yukawa interactions in the quark and lepton sectors, 
respectively. The Lagrangian for quark Yukawa interactions is given by, 


-Lq = Q; p( Fig ® + Gij ®)Qj,1 + h.c. (4.163) 


where ® = T2®*T2. In general, Fj; and Gj; are Hermitian to preseve left- 
right symmetry. Because of our assumption of SCPV with complex vacuum 
expectation values, the matrices Fj; and Gj; are real. The Yukawa inter- 
actions responsible for generating the lepton masses are summarized in the 
following Lagrangian, Le, 


-Lı = Li (Pð + Rij ®)Lj,1 (4.164) 
+ ifig(LE,CrALL;,1 + Li pCr2ArL;,r) +h.c., 


where C is the Dirac charge conjugation operator, and the matrices P;;, Rij 
and fij are real due to the assumption of SCPV. Note that the Majorana 
mass terms LP Arlt and LTRARLj,r have identical coupling because 
the Lagrangian must be invariant under interchanging L + R. The com- 
plete Lagrangian of the model is invariant under the unitary transformation, 
under which the matter fields transform as 


Yr > Ury, wr — URYR (4.165) 
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where Yz r are left-handed (right-handed) fermions, and the scalar fields 
transform according to 


®—UrðUİ, A,—UtALU!, Ar—>URZARU}, (4.166) 


with the unitary transformations Uz and Up being 


et 0 eR 0 
Ur = ( 0 E D Ur = ( 0 a) ` (4.167) 


Under these unitary transformations, the VEV’s transform as 


Ko Ke OLTI), K! o K'On), (4.168) 


vp > vre L, UR URE R, 


Thus by re-defining the phases of matter fields with the choice of yR = ar/2 
and yL = Qk + apR/2 in the unitary matrices Uz and Up, we can rotate 
away two of the complex phases in the VEV’s of the scalar fields and are 
left with only two genuine CP violating phases, a, and ar, 


0 
<ð >= ( , e (4.169) 


0 0 0 0 
SeS ce a < Arg >= e i 


The quark Yukawa interaction £q gives rise to quark masses after the 
bi-doublet acquires VEV’s 


My, = KE; + ple nt Gij, Ma = Kl Ebr! Fy + KGij i (4.170) 


Thus the relative phase in the two VEV’s in the SU(2) bi-doublet, œw, gives 
rise to the CP violating phase in the CKM matrix. To obtain realistic quark 
masses and CKM matrix elements, it has been shown that the VEV’s of 
the bi-doublet have to satisfy k/k’ ~ m:/my >> 1 [59]. When the triplets 
and the bi-doublet acquire VEV’s, we obtain the following mass terms for 
the leptons 


Me = Kl Ebr! Pi; + KRij, M Deree = kb + Ke tn! Rij (4.171) 
ME? = vRfij, MEL = vrett fij. (4.172) 


The effective neutrino mass matrix, Ms", which arises from the Type-II 
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seesaw mechanism, is thus given by 


1 
Mest = MĮł! — MĮ = (se - SPP p+] UL , (4.173) 
MM MEA MPS) (4.174) 
= (KP + w/e" R) (vrf) (KP + w/e’ R) 
UL pT ¢—1 
PP Pes 
B f 
M} = vrei f. (4.175) 


Consequently, the connection between CP violation in the quark sector and 
that in the lepton sector, which is made through the phase a,-, appears 
only at the sub-leading order, O («’/«), thus making this connection rather 
weak. We will neglect these sub-leading order terms, and there is thus only 
one phase, az, that is responsible for all leptonic CP violation. 

The three low energy phases ô, @21, 431, in the MNS matrix are there- 
fore functions of the single fundamental phase, az. Neutrino oscillation 
probabilities depend on the Dirac phase through the leptonic Jarlskog in- 
variant, which is proportional to sin az, Je p xsinay. There are two ways 
to generate lepton number asymmetry. One is through the decay of the 
SU(2), triplet Higgs, A* — £+ £, and the corresponding asymmetry is 
given by, a e 

EARS fh -TAar >t) 
“~ TAs > +0) 41(A, > 240) 
The asymmetry can also be generated through the decay of the lightest RH 
neutrinos, N; — £+ Ht, and the asymmetry in this case is, 
= PO f+ AN) -T(M > 2+ #) (4.177) 
I(N, > L+ At)+T(N, > L+ H) 
Whether N; decay dominates or A; decay dominates depends upon if N4 is 
heavier or lighter than Az. As the mass of the triplet Higgs is typically at 
the scale of the LR breaking scale, it is naturally heavier than the lightest 
RH neutrino. As a result, Nı decay dominates. With the particle content 
of this model, there are three diagrams at one loop that contribute to 
leptogeiesis, as shown in Fig. 4.15. The contribution from diagram (a) and 
(b) mediated by charged lepton and Higgs doublet, which appear also in 
standard leptogenesis with SM particle content, is given by [60], 


(4.176) 


Im( M M! Mb) 

Ny _ 3 Mr, ( ze ) = 11 

r ae = a (4.178) 
167 \ v (MpM}5)i1 
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Fig. 4.15. Diagrams in the minimal left-right model that contribute to the lepton num- 
ber asymmetry through the decay of the RH neutrinos. 


Now, there is one additional one-loop diagram, Fig. 4.15 (c), mediated by 
the SU(2)z triplet Higgs. It contributes to the decay amplitude of the 
right-handed neutrino into a doublet Higgs and a charged lepton, which 
gives an additional contribution to the lepton number asymmetry [60], 


TE A (Ma)  Im(Mp(M}1)* M$) 
16r \ v2 (MpM})11 


where Mp is the neutrino Dirac mass term in the basis where the RH 
neutrino Majorana mass term is real and diagonal, 


€ , (4.179) 


Mp=0rMp, f“ =OrfOR. (4.180) 


Because there is no phase present in either Mp = Pr or M$ or Op, the 
quantity Mp (Mj)* Mz, is real, leading to a vanishing «%'. This state- 
ment is true for any chosen unitary transformations Uz and Up defined in 
Eq. (4.167). On the other hand, the contribution, e4*, due to the diagram 
mediated by the SU(2)p triplet is proportional to sin az. 

As all leptonic CP violation in this model come from one single origin, 
that is, the phase in the VEV of the LH triplet, (Az), strong correlation 
between leptogenesis and low energy CP violating processes can thus be 
established. In particular, both JE p and e are proportional to sina;. 

It has been found recently that, by lowering the left-right symmetry 
breaking scale with an additional U (1) symmetry, the link between CP 
violation in the quark sector and that in the lepton sector can also be 
established [61]. 


4.5. Recent Progress and Concluding Remarks 


Leptogenesis provides a very appealing way to generate the observed cosmo- 
logical baryonic asymmetry. It has gained a significant amount of interests 
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ever since the advent of the evidence of non-zero neutrino masses. In this 
scenario, the baryonic asymmetry is closely connected to the properties of 
the neutrinos, and the fact that the required neutrino mass scale for suc- 
cessful leptogenesis is similar to the scale observed in neutrino oscillations 
makes leptogenesis a very plausible source for the cosmological baryonic 
asymmetry. Even though there is so far no direct way to test leptogen- 
esis, the search for leptonic CP violation in neutrino oscillations at very 
long baseline experiments [66] and to look for lepton number violation in 
neutrinoless double beta decay will inevitably further the credibility of lep- 
togenesis as a source of the baryon asymmetry. 

The recent developments in the subject of leptogenesis have been fo- 
cused on the role of flavor. Recall that the total asymmetry given in Eq. 4.89 
have summed over all three flavor indices, 


a= X e, (4.181) 


a=e,p,T 


2 is the CP asymmetry in the a-flavor. Correspondingly, previ- 


where € 
ous solutions to the Boltzmann equations have summed over all the three 


flavors, e, u, T, and thus they did not include flavor dependence [62], 


d(Yn, oa YN) z Yn, 
= -_ ey eel 4.182 
dz OAR E ) 184) 
ae 
dz” 
dYL Z Yn, 
— =—-_|( 3-1 4.1 
dz sH(M,) (7 €1YD ( 83) 
Y; 
aes ya (YDYAat=1 + a) ; 
L 


where Yy, and Yz are the number density of the lightest right-handed 
neutrino N; and of the lepton number asymmetry, respectively, and y’s are 
the decay rates for the processes specified in the subscripts. It has recently 
been pointed out that flavor effects matter if heavy neutrino masses are 
hierarchical [62]. The Yukawa interactions of all three flavors, e, u and 
T, reach equilibrium at different temperatures. These temperatures are 
determined by the size of the Yukawa couplings, A, as 


X?’ Mpi = Tez - (4.184) 


Due to the relative large coupling constant, the r Yukawa interactions reach 
equilibrium at T ~ 10!% GeV, while the muon Yukawa interactions reach 
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equilibrium at T ~ 10° GeV. If leptogenesis takes place at T ~ Mı > 10! 
GeV, the Yukawa interactions of all three lepton flavors are out of equilib- 
rium, and hence the three flavors are indistinguishable. In particular, the 
washout factor is universal for all three flavors. However, if leptogenesis 
takes place at temperature below 101% GeV, which is generally the case for 
hierarchical right-handed neutrino masses, the three flavors are distinguish- 
able and thus their effects should be included in the Boltzmann equations 
properly. Instead of a single evolution function for Yz as given in Eq. 4.183, 
one should consider the evolution of the lepton number asymmetry, Y°°, 
which is due to the decay of the lightest right-handed neutrino into charged 
lepton of flavor a with the corresponding asymmetry given by e°“ and 
decay rate given by y@* [62], 


dye“ 7 Yn, 
2 — 1 |) et (yom = 4.185 
dz sH(M,) E )e (QD + Yar=1) (ATR 
yea aa 
z Yg (Vp + Yar=1)| , 


Note that in the above equation, there is no summation over the flavor in- 
dex, a. By properly including the flavor effects, the amount of leptogenesis 
may be enhanced by a factor of 2 to 3 [62]. 

Except for the specific types of models [52, 54] discussed in Sec. 4.4, the 
general lack of connection between leptogenesis and low energy CP viola- 
tion translates into the fact that the observation of the leptonic Dirac or 
Majorana phases at low energy does not imply non-vanishing leptogenesis. 
This statement is weakened in a framework when the right-handed neutrino 
sector is CP invariant and when the flavor effects are important [63]. This 
is elucidate by introducing the “orthogonal parametrization” for neutrino 
Dirac Yukawa matrix [64], 


h = İM"? Rm"/2Ut ; (4.186) 
U 


where m = diag(m1, m2, msg) is the diagonal matrix of the light neutrino 
masses, M is the diagonal matrix of the right-handed neutrino masses and U 
is the MNS matrix. The orthogonal matrix R is defined by this equation as 
R=vM~‘/?hUm-‘/?, In the basis where the right-handed neutrino mass 
matrix and the charged lepton mass matrix are diagonal, the neutrino Dirac 
Yukawa matrix can be written as h = VřÍdiag(hı, h2, h3)Vř. Therefore, 
the low energy CP violation in the lepton sector can arise from either the 
left-handed sector through V/’, the right-handed sector through Vz, or from 


both. From hhtv? = Vý 'diag(h?, h2, h2)V Zu? = M'/2RmRtM"?, it can 
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be seen that the phases of R are related to those in the right-handed sector 
through Vg. The asymmetry €; given in Eq. 4.89, which is derived with 
one-flavor approximation, can be rewritten as follows [65], 


3M, In(Xm3R,) 
16rv? dia MalRial? ` 


a= (4.187) 
Assuming the right-handed sector is CP invariant, low energy CP phases 
can then arise entirely from the left-handed sector and thus are irrelevant for 
€1, which vanishes because the orthogonal matrix R is real. If leptogenesis 
takes place at T < 101? GeV, the flavor effects must be taken into account. 
In this case the asymmetry in each flavor is given by [65], 


___ 3M, m(Z p, mg m UzgUao Rig Ray) (4.188) 
on 16rv? yy me|Rip|? ` ` 


The contribution of each of these individual asymmetries to the total asym- 
metry is then weighted by the corresponding washout factor. Therefore, 
barring accidental cancellations, the presence of the MNS matrix elements 
in Eq. 4.188 signifies the need for low energy CP violation in order to 
have leptogenesis. Hence if leptonic CP violation in neutrino oscillations is 
observed at future very long baseline experiments [66] and if lepton num- 
ber violation is established by observing neutrinoless double beta decay, it 
would even more strongly suggest than it has been that leptogenesis be the 
source for the origin of the cosmological baryon asymmetry. 

Finally, a fundamental problem in the current treatment of leptogen- 
esis is the fact that the Boltzmann equations utilized in the present cal- 
culations are purely classical treatment. However, the collision terms are 
zero-temperature S-matrix elements which involve quantum interference. 
In addition, the time evolution of the system should be treated quantum 
mechanically. These lead to the need of quantum Boltzmann equations 
which is based on Closed-Time-Path (CTP) formalism [67]. A more de- 
tailed discussion on this issue can be found in Refs. [6, 68]. 
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Neutrino Experiments 


J. M. Conrad 


Department of Physics, 
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New York, NY 


This article is a summary of four introductory lectures on “Neutrino Ex- 
periments,” given at the 2006 TASI summer school. The purpose was to 
sketch out the present questions in neutrino physics, and discuss the exper- 
iments that can address them. The ideas were then explored in depth by 
later lecturers. 

This article begins with an overview of neutrinos in the Standard Model 
and what we know about these particles today. This is followed by a dis- 
cussion of the direction of the field, divided into the three themes identified 
in the APS Study on the Future of Neutrino Physics.| This APS study rep- 
resented the culmination of a year-long effort by the neutrino community 
to come to a consensus on future directions. The report is recommended 
reading for students, along with the accompanying working group white 
papers, especially the Theory Group Whitepaper.” 

While these lectures used the APS Neutrino Study themes as the core, 
the emphasis here is different from the APS report. The point of a summer 
school is to teach specific ideas rather than provide a perfectly balanced 
overview of the field. The result is that, with apologies, some experiments 
were necessarily left out of the discussion. Students are referred to the 
Neutrino Oscillation Industry Website? for a complete list of all neutrino 
experiments, by category. 


5.1. Neutrinos As We Knew Them 


Neutrinos are different from the other fermions. Even before the recent 
evidence of neutrino mass, neutrinos were peculiar members of the Standard 
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Model. They are the only fermions 


e to carry no electric charge. 
e for which we have no evidence of a right-handed partner. 
e that are defined as massless. 


These ideas are connected by the fact that, unlike other spin 1/2 particles, 
neutrinos can only interact through the weak interaction. 

Even though the Standard Model picture is now demonstrably wrong, 
this theoretical framework provides a good place to start the discussion. 
This section begins by expanding on the Standard Model picture of the 
neutrino sketched above. It then discusses how neutrinos interact. This is 
followed by an overview of neutrino sources and detectors. 


5.1.1. Neutrinos in the Standard Model 


Neutrinos are the only Standard Model fermions to interact strictly via 
the weak interaction. This proceeds through two types of boson exchange. 
Exchange of the Z° is called the neutral current (NC) interaction. Exchange 
of the W= is called the charged current (CC) interaction. When a W is 
emitted, charge conservation at the vertex requires that a charged lepton 
exits the interaction. We know the family of an incoming neutrino by the 
charged partner which exits the CC interaction. For example, a scattered 
electron tags a Ve interaction, a u tags a v, interaction, and a T tags a 
v, interaction. The neutrino always emits the Wt and the antineutrino 
always emits the W~ in the CC interaction. In order to conserve charge at 
the lower vertex, the CC interaction is flavor-changing for target quarks. 
For example, in a neutrino interaction, if a neutron, n, absorbs a WT, a 
proton, p, will exit the interaction. The W has converted a d quark to a u 
quark. The first two diagrams shown on Fig. 5.1 illustrate a NC and a CC 
interaction, respectively. 

In 1989, measurements of the Z° width at LEP* and SLD® determined 
that there are only three families of light-mass weakly-interacting neutrinos, 
although we will explore this question in more depth in section 3 of these 
lectures. These are the ve, the v,, and the v+. The interactions of the 
ve and v, have been shown to be consistent with the Standard Model 
weak interaction. Until recently, there has only been indirect evidence for 
the v, through the decay of the r meson. In July 2000, however, the 


DoNuT Experiment (E872) at Fermilab presented direct evidence for v, 
6 


interactions. 
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Fig. 5.1. Examples of the four types of neutrino interactions which appear throughout 
this discussion and are defined in Sec. 5.1.2. The first two diagrams show an NC and 
CC interaction, respectively. 


Within the Standard Model, neutrinos are massless. This assumption is 
consistent with direct experimental observation. It is also an outcome of the 
feature of “handedness” associated with neutrinos. To understand hand- 
edness, it is simplest to begin by discussing “helicity,” since for massless 
particles helicity and handedness are identical. 

For a spin 1/2 Dirac particle, helicity is the projection of a particle’s 
spin (©) along its direction of motion p, with operator X . p. Helicity has 
two possible states: spin aligned opposite the direction of motion (negative, 
or “left helicity” ) and spin aligned along the direction of motion (positive or 
“right helicity”). If a particle is massive, then the sign of the helicity of the 
particle will be frame dependent. When one boosts to a frame where one is 
moving faster than the particle, the sign of the momentum will change but 
the spin will not, and therefore the helicity will flip. For massless particles, 
which must travel at the speed of light, one cannot boost to a frame where 
helicity changes sign. 

Handedness (or chirality) is the Lorentz invariant (i.e., frame- 
independent) analogue of helicity for both massless and massive Dirac par- 
ticles. There are two states: “left handed” (LH) and “right handed” (RH). 
For the case of massless particles, including Standard Model neutrinos, he- 
licity and handedness are identical. A massless fermion is either purely 
LH or RH, and, in principle, can appear in either state. Massive particles 
have both RH and LH components. A helicity eigenstate for a massive 
particle is a combination of handedness states. It is only in the high en- 
ergy limit, where particles are effectively massless, that handedness and 
helicity coincide for massive fermions. Nevertheless, people tend to use the 
terms “helicity” and “handedness” interchangeably. Unlike the electromag- 
netic and strong interactions, the weak interaction involving neutrinos has 
a definite preferred handedness. 
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In 1956, it was shown that neutrinos are LH and outgoing antineutrinos 
are RH.” This effect is called “parity violation.” If neutrinos respected 
parity, then an equal number of LH and RH neutrinos should have been 
produced in the 1956 experiment. The fact that all neutrinos are LH and 
all antineutrinos are RH means that, unlike all of the other fermions in the 
Standard Model, parity appears to be maximally violated for this particle. 
This is clearly very strange. 

We need a method to enforce parity violation within the weak interac- 
tion theory. To this end, consider a fermion wavefunction, ~, broken up 
into its LH and RH components: 


Y = YL + dr. (5.1) 
We can introduce a projection operator which selects out each component: 
YPL, R = FUL,R- (5.2) 


To force the correct handedness in calculations involving the weak interac- 
tion, we can require a factor of (1 — y°)/2 at every weak vertex involving a 
neutrino. As a result of this factor, which corresponds to the LH projection 
operator, we often say the charged weak interaction (W exchange) is “left 
handed.” 

Note that by approaching the problem this way, RH neutrinos (and LH 
antineutrinos) could in principle exist but be undetected because they do 
not interact. They will not interact via the electromagnetic interactions 
because they are neutral, or via the strong interaction because they are 
leptons. RH Dirac neutrinos do not couple to the Standard Model W, 
because this interaction is “left handed,” as discussed above. Because they 
are non-interacting, they are called “sterile neutrinos.” By definition, the 
Standard Model has no RH neutrino. 

With no RH partner, the neutrino can have no Dirac mass term in 
the Lagrangian. To see this, note that the free-particle Lagrangian for a 
massive, spin 1/2 particle is 


L = hyp 3p — maby. (5.3) 
However, Yy can be rewritten using 
Yr, r = 1/21 F 5, (5.4) 
Yr R = 1/21 +7"), 5.5 
giving 
by = gl- sat ++ — $ ei . lu =Udat devs. (5.6) 
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In other words, an may (“mass”) term in a Lagrangian mixes RH and 
LH states of the fermion. If the fermions have only one handedness (like 
vs), then the Dirac mass term will automatically vanish. In the Standard 
Model, there is no Dirac mass term for neutrinos. 


5.1.2. Neutrino Interactions 


Neutrino interactions in the Standard Model come in four basic types. Fig- 
ure 5.1 shows examples of the four interactions. In Elastic scattering, “what 
goes is what comes out,” just like two billiard balls colliding. An example 
is a NC interaction where the target is does not go into an excited state or 
break up, e.g., Ve +n — ve +n. A more complicated example is electron- 
neutrino scattering from electrons, where the W exchange yields a final 
state which is indistinguishable from the Z exchange on an event-by-event 
basis, so this is categorized as an elastic scatter. Quasi-elastic scattering is, 
generally, the CC analogue to elastic scattering. Exchange of the W causes 
the incoming lepton and the target to change flavors, but the target does 
not go into an excited state or break apart. An example is v, +n — u+ p. 
Single pion production may be caused by either NC or CC interactions. 
In resonant single pion production, the target becomes a A which decays 
to emit a pion. In coherent scattering, there is little momentum exchange 
with the nucleon and a single pion is produced diffractively in the forward 
direction. The case of NC single 7° production is particularly important, 
because this forms a background in many neutrino oscillation searches. Fi- 
nally, DIS, or Deep Inelastic Scattering, is the case where there is large 
4-momentum exchange, breaking the nucleon apart. One can have NC or 
CC deep inelastic scattering. 

Figure 5.2 summarizes the low energy behavior of o/E for CC events 
(solid line), as predicted by the NUANCE neutrino event generator.? The 
quasi-elastic, single pion and deep inelastic contributions are indicated by 
the broken curves. The data indicate the state of the art for neutrino 
cross section measurements. One can see that if precision neutrino studies 
are to be pursued in the MeV to few GeV range, that more accurate mea- 
surements are essential. The MiniBooNE,!° SciBooNE,!! and MINERvA™ 
experiments are expected to improve the situation in the near future. 

Above a few GeV, the total neutrino cross section rises linearly with 
energy. The total cross section is the sum of many partial cross sections: 
quasi-elastic + single pion + two pions + three pions + etc. As the energy 
increases, each of these cross sections sequentially “turns on” and then 
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Fig. 5.2. Current status of vą CC cross section measurements in the 1 to 100 GeV range. 
This plot shows o/E, thus removing the linear energy dependence at high energies. Note 
the low energy cut-off due to the muon mass suppression. Components of the total cross 
section are indicated by the curves.® 


becomes constant with energy. Thus the sum, which is the total cross 
section, increases continuously and linearly with E. 

Nevertheless, even at high energies, this interaction is called “weak” 
for good reason. The total cross section for most neutrino scattering ex- 
periments is small. For 100 GeV vy, interactions with electrons, the cross 
section is ~ 10~4° cm?. For 100 GeV v, interactions with nucleons, the 
cross section is ~ 10736 cm?. This is many orders of magnitude less than 
the strong interaction. For example, for pp scattering, the cross section is 
~ 107? cm?. The result is that a 100 GeV neutrino will have a mean free 
path in iron of 3 x 109 meters. Thus most neutrinos which hit the Earth 
travel through without interacting. It is only at ultra-high energies that 
the Earth becomes opaque to neutrinos, as discussed in Sec. 5.3.3.2. 

In principle, the interactions of the ve, Vu, and v, should be identical 
(“universal”). In practice, the mass differences of the outgoing leptons lead 
to considerable differences in the behavior of the cross sections. In the 
CC interaction, you must have enough CM energy to actually produce the 
outgoing charged lepton. Just above mass threshold, there is very little 
phase space for producing the lepton, and so production will be highly 
suppressed. The cross section increases in a non-linear manner until well 
above threshold. Consider, for example, a comparison of the ve and v, CC 
quasielastic cross section on carbon, shown in Fig. 5.3. At very low energy 
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Fig. 5.3. The ratio of the ve to v, CC cross sections as a function of neutrino energy, 
showing the suppression due to the lepton mass. 


the CC v, cross section is zero, while the ve cross section is non-zero, 
because the 105 MeV muon cannot be produced. The ratio approaches one 
at about 1 GeV. A similar effect occurs for the v, CC interaction cross 
sections. The mass of the 7 is 1.8 GeV, resulting in a cross section which is 
zero below 3.5 GeV and suppressed relative to the total v, CC scattering 
cross section for v, beam energies beyond 100 GeV. At 100 GeV, which 
corresponds to a center-of-mass energy of V2ME ~ 14 GeV, there is still 
a 25% reduction in the total CC v, interaction rate compared to v,, due to 
leptonic mass suppression. 

For low energy neutrino sources, the CC interaction may also be sup- 
pressed due to conversion of the nucleon at the lower vertex. For ex- 
ample, the CC interaction commonly called “inverse beta decay” (IBD), 
Dep — etn, which is crucial to reactor neutrino experiments, has a thresh- 
old of 1.084 MeV, driven by the mass difference between the proton and 
the neutron plus the mass of the positron. In the case of bound nuclei, the 
energy transferred in a CC interaction must overcome the binding energy 
difference between the incoming and outgoing nucleus as well as the mass 
suppression due to the charged lepton. This leads to nuclear-dependent 
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Table 5.1. Reactions from the Sun producing neutrinos. 
Common Terminology Reaction 

“pp neutrinos” 

“nep neutrinos” 

“7 Be neutrinos” 


“8B neutrinos” 
“hep neutrinos” 


3He + p * He + et + ve 


thresholds for the CC interaction. For example: 


35C1(75.8%) =?” Ar: 5.967 MeV; 
37C](24.2%) 3” Ar: 0.813 MeV; 
®°Ga(60.1%) °° Ge: 2.227 MeV; 
1Ga(39.9%) ="! Ge: 0.232 MeV. 


are the thresholds for isotopes which have been used as targets in past solar 
neutrino (ve) detectors.1* 16 

In discussing neutrino scattering at higher energies, several kinematic 
quantities are used to describe events. The squared center of mass energy is 
represented by the Mandelstam variable, s. The energy transferred by the 
boson is v, and y = v/E, is the fractional energy transfer, or “inelasticity.” 
The distribution of events as a function of y depends on the helicity. For 
neutrino scattering from quarks, the y-dependence is flat, but for antineu- 
trinos, the differential cross section is peaked at low y. The variable Q? 
is the negative squared four-momentum transfer. Deep inelastic scattering 
begins to occur at Q? ~ 1 GeV?. If x is the fractional momentum carried 
by a struck quark in a deep inelastic scatter, then x = Q?/2Mv, where 
M is the target mass. Elastic and quasieleastic scattering occur at x = 1, 
hence Q? = 2Mv & sgy, valid for large s. 


5.1.3. Sources of Neutrinos 


With such a small interaction probability, it is clear that intense neutrino 
sources are needed to have high statistics in a neutrino experiment. The 
primary sources of neutrino for interactions observed on Earth are the Sun, 
cosmic-ray interactions, reactors, and accelerator beams. 

At present, there are two intense sources in the few MeV range that allow 
for low energy neutrino interaction studies. First, the interactions in the 
Sun produce a pure ve flux, as listed in Table 5.1. The energy distribution of 
neutrinos produced by these reactions is shown in Fig. 5.4. The sensitivity 
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Fig. 5.4. The flux predicted by the Standard Solar Model.!? The sensitivity of past 
solar neutrino detectors varies due to CC threshold in the target material.41° The 
thresholds for various experiments is shown at the top of the plot. 


of various solar neutrino experiments, due to the CC threshold, is shown at 
the top of the figure. There is no observable antineutrino content. The best 
limit on the solar neutrino De/Ve ratio for E, > 8.3 MeV is 2.8x 107+ at 90% 
CL.'’ The second source is from reactors. In contrast to the Sun, reactors 
produce a nearly pure flux. The energy peaks from ~ 3 to 7 MeV. 
Neutrinos from ( decay of accelerated isotopes could, in principle, represent 
a third intense source of neutrinos in the MeV range (or higher), once the 
technical issues involved in designing such an accelerator are overcome. 
Such a“beta beam” would produce a very pure ve or De beam, depending 
on the accelerated isotope.!® 

At present, higher energy experiments use neutrinos produced at accel- 
erators and in the atmosphere. In both cases, neutrinos are dominantly 
produced via meson decays. In the atmospheric case, cosmic rays hit at- 
mospheric nuclei producing a shower of mesons which may decay to neutri- 
nos along their path through the atmosphere to Earth. In a conventional 
neutrino beam, protons impinge on a target, usually beryllium or carbon, 
producing secondary mesons. In many experiments, the charged mesons 
are focussed (bent) toward the direction of the experiment with a magnetic 


186 J. M. Conrad 


Table 5.2. Common sources of neutrinos in atmospheric and accelerator experiments. 
2-body pion decay 
2-body kaon decay 

muon decay 
Ke3 decay 


TT >My, Tw Fh Dy 
KY —> pvp, KO > W Du 
pt —> et Dyve, HT — e7 VpDe 
K+ — Tet Ve, K- = Te De, Ko. nT etve, K? => nte De 


device called a horn. These devices are sign-selecting — they will focus one 
charge-sign and defocus the other — and so produce beams which are dom- 
inantly neutrinos or antineutrinos depending on the sign-selection. The 
beamline will have a long secondary meson decay region, which may be air 
or vaccuum. This is followed by a beam dump and an extended region of 
dirt or shielding to remove all particles except neutrinos. There are excel- 
lent reviews of methods of making accelerator-produced neutrino beams.!® 
Table 5.2 summarizes the common sources of neutrino production in the 
atmosphere and conventional accelerator based beams. 

Many atmospheric and accelerator-based neutrino experiments are de- 
signed to study 100 MeV to 10 GeV neutrinos. The atmospheric neutrino 
flux drops as a power-law with energy, and the 1 to 10 GeV range dominates 
the event rate. Accelerator beams can be tuned to a specific energy range 
and, using present facilities, can extend to as high as 500 GeV. From the 
viewpoint of sheer statistics, one should use the highest energy neutrino 
beam which is practical for the physics to be addressed, since the cross 
section rises linearly with energy. However, lower neutrino energy beams, 
from ~ 1 to 10 GeV, are typically used for oscillation experiments. In these 
experiments, having a cleanly identified lepton in a low multiplicity event 
trumps sheer rate, and so ~ 1 GeV beams are selected to assure that CCQE 
and single pion events dominate the interactions. 

Both atmospheric and accelerator based neutrino sources are domi- 
nantly v,,-flavor. The main source of these neutrinos is pion decay. To 
understand why pions preferentially decay to produce vy, rather than ve, 
consider the case of pion decay to a lepton and an antineutrino: 7~ — 07 De. 
The pion has spin zero and so the spins of the outgoing leptons from the de- 
cay must be opposite from angular momentum conservation. In the center 
of mass of the pion, this implies that both the antineutrino and the charged 
lepton have spin projected along the direction of motion (“right” or “posi- 
tive” helicity). However, this is a weak decay, where the W only couples to 
the RH antineutrino and the LH component of the charged particle. The 
amplitude for the LH component to have right-helicity is proportional to 
m/E. Thus it is very small for an electron compared to the muon, pro- 
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ducing a significant suppression for decays to electrons. Calculating the 
expected branching ratios: 


I(a* > e* ve) 


Rtheory = T(n= = p=»,) (5.7) 
2 2 m2 2 
= (=) Ga Me) (5.8) 
Mu ‘Mr? — My 
= 1.23 x 1074; (5.9) 


This compares well to the data:?? Resp = (1.230 + 0.004) x 1074. 
The above discussion assumed the neutrino was massless. If the neu- 
trino is massive, then it too can be produced with wrong helicity with an 


amplitude proportional to m,/E and thus a probability proportional to 
(m,/E)?. As discussed in Sec. 5.2.2, below, neutrino mass is limited to be 
very small (~ eV) and thus the rate of wrong-helicity neutrino production 
is too low a level for any chance of observation in the near future. 

Depending on the energy, there may also be significant neutrino produc- 
tion from kaon decays. The charged kaon preferentially decays to the v, 
for the same reason as the charged pion. However, for equal energy mesons, 
the kinematic limit for a neutrino from Kt decay is much higher than for 
n+ decay: EK = 0.98Ex compared to E™*" = 0.43E,. Thus the 
neutrinos from kaon decays can be isolated by studying the highest energy 
component of a beam. Figure 5.5 shows the contributions of pion and kaon 
decays to the v, flux in the MiniBooNE experiment, which uses an 8 GeV 
primary proton beam. 

Electron neutrino flavors are produced in these beams through kK — 
mee (called “Ke3”) decay and through the decay of the muons which were 
produced in the pion decay. These are three-body decays which avoid 
substantial helicity suppression. Helicity does, however, affect the energy 
spectrum of the outgoing decay products. In an accelerator-based experi- 
ment, the level of electron-flavor content can be regulated, at some level, by 
the choice of primary beam energy and the length of the decay region. A 
low primary beam energy will suppress kaon production because of the rela- 
tively high mass of this meson (494 MeV). A short decay pipe will suppress 
ve from u decay, which tends to occur downstream, because it is produced 
in a multi-step decay chain (7 —> u —> ve). Both of these methods of sup- 
pressing Ve production also lead to a reduction in the v, production rate, 
so an experimenter must balance competing goals in the beam design. In 
the case of atmospheric neutrinos, the ratio is roughly 2:1 for v,:7¢, though 
the fraction ve’s changes with energy (see Fig. 5.6). The atmospheric flux 
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v, from K decay 
Ve from n & u decay 
Ve from K decay 


Neutrinos/POT/50MeV at MiniBooNE Tank 


0 0.5 1 1.5 2 25 3 3.5 
E, (GeV) 


Fig. 5.5. The contributions from pion and kaon production to the total predicted vy, 
flux in the MiniBooNE experiment. The spikes at low energy in the K-produced fluxes 
are due to decays of stopped kaons in the beam dump. 1° 


depends on the location of the detector because charged particle are bent 
by the Earth’s magnetic field. The variation between fluxes at the Kamioka 
mine in Japan and Soudan mine in Minnesota are shown in Fig. 5.6. 

As we move to a precision era in neutrino physics, precise “first- 
principles” predictions of the flux are becoming very important. For conven- 
tional accelerator-based neutrino beams and for the atmospheric flux, this 
requires well-measured cross-sections for production of secondary pions and 
kaons. This has motivated a range of secondary production experiments. 
The kinematic coverage is shown on Fig. 5.7. 

The future of high intensity v, and ve beams is likely to lie in beams 
produced from muon decay. Because of the potential for very high intensity, 
these beams are called “Neutrino Factories.” The concept is very attractive 
because it produces beams which are very pure v, and De from u~ and 
vice versa from p*. Each flavor has no “wrong sign” (antineutrino-in- 
neutrino-beam or neutrino-in-antineutrino beam) background. However, 
neutrino factory designs!® necessarily produce high energy neutrinos, since 
the muons must be accelerated to high energies in order to live long enough 
to be captured and circulated in an accelerator. The Neutrino Factory is 
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Fig. 5.6. The variation in the atmospheric neutrino flavor content as a function of 
energy for two locations, Japan (solid line, open circles) and Minnesota (dashed line, 
closed squares). The points are from a full 3-dimensional monte carlo of the flux, while 
the histograms are from a simpler model.?! 


seen as a promising first machine for testing ideas for a muon collider,'® 
and thus has attracted interest beyond the neutrino community. 

A beam enriched in v, can be produced by impinging very high energy 
protons on a target to produce D,-mesons which are sufficiently massive 
to decay to T vy. The 7 lepton is very massive, at 1.8 GeV, compared to 
the muon, at 106 MeV, and thus helicity considerations for the D, decay 
strongly favor the T v, mode compared to u Vu, by a ratio of about 10:1. 
The 7 then subsequently decays, also producing v;s. 

Unfortunately, because of the short lifetime, it is not possible to separate 
Ds, mesons from the other mesons prolifically produced by the primary 
interaction. As a result, the beam is dominated by the v,s produced by 
decays of other mesons. To reduce the production of v,,, experiments use 
a “beam dump” design where protons hit a very thick target where pions 
can be absorbed before decaying. The only enriched-v,; beam created to 
date was developed by DoNuT.® They used an 800 GeV proton on a beam 
dump, to produce a ratio of ve:Vu:Vr of about 6:9:1. 


190 J. M. Conrad 


~ 3 T T T TTITtit T T T T TTT. T T TITIM F 
o 10 E- wt 
> E SPY |”. 
D jE 
Po} C 
= | Atherton et al. 
a i ae E 
[o FNAL oe EEEE 
Serpukhov og Bal Mou cee 
as P E de 
"r E E oo o 8 
m Allaby et al. of EEE A Ea 
Eichten et al. o oo oe oOo ea ooon a 
yeo 2 ogo nn 000 0 oo n an 
F ar opooOoOOaeoad noaa 
L Abbott et al. “ Oo mooooOoOoe goo w 
oo Of OOaoaoa a oo 8 a 
Oan (mi Bouooon a « o o ono on 
5 oe oporna a oo = @ 
10 Cho et al. a Ete JAn a eee a i i 
E S a HOoOMoOoooOooes oo = @ 
fe - Bon mimma mingga Be dee ake S 
L ‘8 OWOOOMoOooocaass ab aves 
SUG Fr IOooooooooooe es ee 
» OCG [JOO Oooeoasae oo: 
L i o QQ0QO0O00000mn09%2% è 
-oO o 0Qi0000o000no0sass.-- 
a : HHpogopnans a. e BE de cae, hate a 
1 ae GOGO 0CRboooes s s ee s> s 
3 ao MOOWODeeebo ese eto te s> eee 
"00 oonnojoOononn0n0onanaas as ’‘s’sa’e ‘a’ 
C = 2 8 oo oO E 8 ojoon o no n oo Be 
[e a HARPA ----- a 
10! L J i Lori 1 i iiti i J jt IJJ 
2 
1 10 10 


Peru (GeV/c) 


Fig. 5.7. The kinematic range covered by recent experiments measuring secondary pion 
and kaon production.?# 


5.1.4. Typical Neutrino Detectors 


Because neutrinos interact so weakly, the options for detectors are limited 
to designs which can be constructed on a massive scale. There are sev- 
eral general styles in use today: unsegmented scintillator detectors, unseg- 
mented Cerenkov detectors, segmented scintillator-and-iron calorimeters, 
and segmented scinitillator trackers. The most promising future technol- 
ogy is the noble-element based detector, which is effectively an electronic 
bubble chamber. Liquid argon detectors are likely to be the first large-scale 
working example of such technology. There are a few variations on these 
five themes, which are considered in later sections in the context of the 
measurement. 
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Unsegmented scintillator detectors are typically used for low energy an- 
tineutrino experiments. Recent examples include Chooz,” KamLAND?6 
and LSND.’ These consist of large tanks of liquid scintillator surrounded 
by phototubes. Usually the scintillator is oil based, hence the target ma- 
terial is CHə and its associated electrons. Often the tubes are in an pure 
oil buffer. This reduce backgrounds from radiation emitted from the glass 
which would excite scintillator. The free protons in the oil provide a target 
for the interaction, Pep — e*n, which is the key for reactor experiments. 
The reaction threshold for this interaction is 1.806 MeV due to the mass 
differences between the proton and neutron and the mass of the positron. 
The scintillation light from the e*, as well as light from the Compton scat- 
tering of the 0.511 MeV annihilation photons provide an initial (“prompt” ) 
signal. This is followed by n capture to produce deuterium and a 2.2 MeV. 
This sequence — positron followed by neutron capture — provides a clean 
signal for the interaction. Doping the liquid scintillator with gadolinium 
substantially increases the neutron capture cross section as well as the vis- 
ible energy produced in the form of gammas upon neutron capture. 

Unsegmented scintillator detectors are now being introduced for low 
energy solar neutrino measurements at Borexino,?? KamLAND?? and 
SNO+.°° These provide energy information on an event-by-event. basis, 
unlike most past solar neutrino experiments, such as Homestake,'4 SAGE! 
and GallEx,!® which intergrated over time and energy. However, these are 
very difficult experiments to perform because a neutron is not produced 
and so the scattering does not produce a two-fold coincidence, but only a 
prompt flash of light. 

Environmental backgrounds are by far the most important issue in low 
energy experiments. These fall into two categories: naturally occurring 
radioactivity and muon-induced backgrounds. To get a sense for what is 
expected, Fig. 5.8 shows the visible energy distribution of singles events 
from the KamLAND experiment with the sources of environmental back- 
ground identified. The naturally occurring radioactive contaminants mainly 
populate the low energy range of Fig. 5.8, with isotopes from the U and Th 
chain extending to the highest energies. These isotopes must be kept un- 
der control by maintaining very high standards of cleanliness. The second 
source of environmental background, the -decays of isotopes produced by 
cosmic ray muons. These dominate the background for Eye > 4 MeV 
(see Fig. 5.8). These can only be eliminated by shielding the detector from 
cosmic rays. As a result, we must build deep underground laboratories with 
many thousands of meters-water-equivalent (“mwe”) of rock shielding. 
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Fig. 5.8. Energy distribution and sources of singles events in KamLAND as a function 
of visible energy." 


In these scintillator detectors, the CC interaction with the carbon in 
the oil (which produces either nitrogen or boron depending on whether the 
scatterer is a neutrino or antineutrino) has a significantly higher energy 
threshold than scattering from free protons. ve + C — e +N has a 
threshold energy of 13.369 MeV, which arises from the carbon-nitrogen 
mass difference (plus the mass of the electron). In the case of both reactor 
and solar neutrinos, the flux cuts off below this energy threshold. 

Existing unsegmented Cerenkov detectors include MiniBooNE,*? Super 
K,’ and AMANDA.** These detectors make use of a target which is a 
large volume of a clear medium (undoped oil, water and ice, respectively) 
surrounded by or interspersed with phototubes. Undoped oil has the ad- 
vantages of a larger refractive index, leading to larger Cerenkov opening 
angle, and of not requiring a purification system to remove living organ- 
isms. Water is the only affordable medium once a detector is larger than 
a few ktons. For ultra-high energy neutrino experiments, a vast natu- 
ral target is needed. Sea water? and ice*4 have been used. Ice is, to 
date, more successful because it does not suffer from backgrounds from 
bioluminescence. 

In most cases of these detectors, the tubes surround the medium and 
the projected image of the Cerenkov ring is used for particle identification. 
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Fig. 5.9. An example of a muon ring (left) and electron ring (right) in the Super K 
Cerenkov detector.?? 


To understand how this works, first consider the case of a perfect, short 
track. This will project a ring with a sharp inner and outer edge onto 
the phototubes. Next consider an electron produced in a ve CC quasielas- 
tic interaction. Because the electron is low mass, it will multiple scatter 
and easily bremsstrahlung, smearing the light projected on the tubes and 
producing a “fuzzy” ring. A muon produced by a CC quasileastic v„ in- 
teraction is heavier and thus will produce a sharper outer edge to the ring. 
For the same visible energy, the track will also extend farther, filling the in- 
terior of the ring, and perhaps exit the tank. Fig. 5.9 compares an electron 
and muon ring observed in the Super K detector. If the muon stops within 
the tank and subsequently decays, the resulting “michel electron” provides 
an added tag for particle identification. In the case of the u~, 18% will 
capture in water, and thus have no michel electron tag, while only 8% will 
capture in oil. 

Scintillator and iron calorimeters provide affordable detection for v, in- 
teractions in the range of ~1 GeV and higher. Recent examples include the 
MINOS*6 and NuTeV*" experiments. In these detectors, the iron provides 
the target, while the scintillator provides information on energy deposition 
per unit length. This allows separation between the hadronic shower, which 
occurs in both NC and CC events, and the minimum ionizing track of an 
outgoing muon, which occurs in CC events. Transverse information can 
be obtained if segmented scintillator strips are used, or if drift chambers 
are interspersed. The light from scintillator strips is transported to tubes 
by mirrored wave-length-shifting fibers. Transverse information improves 
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Fig. 5.10. A CCQE(vy, +n —> u + p) event observed in the SciBooNE detector. The 
long, minimum-ionizing red track is identified as the muon, the short, heavily-ionizing 
red track is identifed as the proton.!! 


separation of electromagnetic and hadronic showers. The iron can be mag- 
netized to allow separation of neutrino and antineutrino events based on 
the charge of the outgoing lepton. 

In all three of the above detector designs, it is difficult to reconstruct 
multi-particle events. Tracking is not an option for an unsegmented scin- 
tillator detector. Cerenkov detectors can typically resolve two tracks per 
event. Segmented calorimeters reduce multiple hadrons to a shower, ob- 
scuring any track-by-track information other than from muons. 

To address the problem of track reconstruction in low energy (< 1 GeV), 
low multiplicity events, there has been a move toward all-scintillator track- 
ing detectors. This began with the SciBar detector in K2K.°8 This detector 
used scintillator strips, as in MINOS, but without interspersing iron. As a 
result, low energy (few MeV) tracks were clearly observable and quasielastic 
and single pion events could be fully reconstructed. SciBar has since been 
incorporated into the SciBooNE experiment at Fermilab.'' The CCQE 
event in SciBooNE, shown in Fig. 5.10, makes clear the benefits of fine 
segmentation. The position of the vertex and the short track from the pro- 
ton are well-resolved in the SciBar detector (green region). The technology 
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has been taken further by the MINERvA experiment, which has attained 2 
mm resolution with their prototype.°? Scibar and MINERVA are relatively 
small (few ton) detectors. The first very large scale application of this tech- 
nology will be NOvA, which is a future 15 kton detector.*° This detector 
will use PVC tubes filled with liquid scintillator, which is more cost-effective 
than extruded scintillator strips for very large detectors. Their design also 
loops the wave-length shifting fiber, so that there are, effectively, two per- 
fectly mirrored fibers are in each cell. This elegant solution increases the 
collected light by a factor of four, which is necessary for ~ 15 m strips. 

The most promising new technology for high resolution track recon- 
struction in neutrino physics is the liquid argon TPC. A TPC, or time 
projection chamber, uses drift chambers to track in the x and y views and 
drift time to determine the z view. Liquid argon (LAr), which provides 
the massive target for the neutrino interaction, also scintillates, providing 
the start for the drift-time measurement. A key point for future neutrino 
experiments is the high efficiency for identifying electron showers (expected 
to be 80-90%) with a rejection factor of 70 for NC 7° events. In particular, 
these detectors can differentiate between converted photons and electrons 
through the dE/dx in the first few centimeters of the track. Typical energy 
resolution for an electromagnetic shower is 3% / VE. 

There is a great deal of activity on development of LAr detectors. Data 
have been taken successfully on a 50 liter LArTPC prototype in the NO- 
MAD neutrino beam at CERN, resulting in reconstruction of ~100 CC 
41 Also, recently, a 600 ton Icarus module has been 
commissioned at Gran Sasso.*? A 0.8 ton LAr test detector will begin tak- 
ing data at Fermilab in January, 2008.43 As discussed in Sec. 5.3.2, the 
microBooNE experiment is a proposed 100 ton detector which would take 
data in 2010.44 In principle, these detectors can be scaled up to tens of 
ktons, as is discussed in the “Ash River Proposal” .4° 


quasielastic events. 


5.2. Neutrinos As We Know Them Now 


The recent discovery of neutrino oscillations requires that we reconsider 
the Standard Model Lagrangian of Sec. 5.1.1. It must now incorporate, 
preferably in a motivated fashion, both neutrino mass and neutrino mix- 
ing. This represents both a challenge and an opportunity for the theory, 
which I will discuss in the following section. This section concentrates on 
the experimental discovery. It is interesting to note that while neither neu- 
trino mass nor mixing were “needed” in the Standard Model theory, both 
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are required for the discovery of neutrino oscillations. The probability for 
neutrino oscillations will be zero unless both effects are present. 

The outcome of the observation of neutrino oscillations is typically sum- 
marized by the statement that “neutrinos have mass.” To be clear: we still 
have no direct measurement of neutrino mass. At this point, we have clear 
evidence of mass differences between neutrinos from the observation of neu- 
trino oscillations. A mass difference between two neutrinos necessarily im- 
plies that at least one of the neutrinos has non-zero mass. All experimental 
evidence indicates that the actual values of the neutrino masses are tiny 
in comparison to the masses of the charged fermions. At the end of this 
section, attempts at direct measurement of neutrino mass are described. 


5.2.1. Neutrino Oscillations 


Recent results on neutrino oscillations provide indisputable evidence that 
there is a spectrum of masses for neutrinos. In this section, I describe the 
formalism for neutrino oscillations, and then review the experimental results 
which have now been confirmed at the 50 level. This is covered briefly 
because these results are well known and covered extensively elsewhere.” 


5.2.1.1. The Basic Formalism 


Neutrino oscillations requires that neutrinos have mass, that the difference 
between the masses be small, and that the mass eigenstates be different 
from the weak interaction eigenstates. In this case, the weak eigenstates 
can be written as mixtures of the mass eigenstates. For example, in a simple 
2-neutrino model: 


Ve = cos Ô 1, + sin 0 12 
Vy = — sin 0 v + cos 0 V2 


where @ is the “mixing angle.” In this case, a pure flavor (weak) eigenstate 
born through a weak decay can oscillate into another flavor as the state 
propagates in space. This oscillation is due to the fact that each of the 
mass eigenstate components propagates with different frequencies if the 
masses are different, Am? = |m} — m?| > 0. In such a two-component 
model, the oscillation probability for v, — Ve oscillations is then given by: 
2 2 

Prob (v, > ve) = sin? 29 sin? (eae) : (5.10) 
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Fig. 5.11. Example of neutrino oscillations as a function of distance from the source, L. 
The wavelength depends upon the experimental parameters L and E (neutrino energy) 
and the fundamental parameter Am?. The amplitude of the oscillation is constrained 
by the mixing term, sin? 26. 


where L is the distance from the source, and E is the neutrino energy. As 
shown in Fig. 5.11, the oscillation wavelength will depend upon L, E, and 
Am?. The amplitude will depend upon sin? 26. 

Neutrino oscillations only occur if the two mass states involved have 
sufficiently small Am? that the neutrino flavor is produced in a superposi- 
tion of two mass states. If the mass splitting is sufficiently large, a given 
neutrino flavor would be produced in one or the other of the two mass 
eigenstates and interference (i.e., oscillations) would not occur. 

Most neutrino oscillation analyses consider only two-generation mixing 
scenarios, but the more general case includes oscillations among all three 
neutrino species. This can be expressed as: 


Ve Ue Uez Ue vı 
Vu = Up Up2 Uy3 V2 
Vr Ur 1 Uz2 U73 V3 


This formalism is analogous to the quark sector, where strong and weak 
eigenstates are not identical and the resultant mixing is described conven- 
tionally by a unitary mixing matrix. The oscillation probability is then: 


Prob (va > vg)= dag — 
.27 Am?, 
å E UaU U: Up 33x? (Sau) , (5.11) 
j>i 
where Am;,, = m? —m, a and 8 are flavor-state indices (e, p, T) and i and 
j are mass-state indices (1, 2,3). 
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For three neutrino mass states, there are three different Am? parame- 
ters, although only two are independent since the two small Am? parame- 
ters must sum to the largest. The neutrino mass states, v1, v2 and v3 are 
defined such that the difference between vı and v2 always represents the 
smallest splitting. However, the mass of v3 relative to vı and v2 is arbitrary 
and so the sign of the Am? parameters which include the third mass state 
may be positive or negative. That is, if v3 > v1, V2, then Am3, will be pos- 
itive, but if 11,2 > v3, then Am3, will be negative. The former is called 
a “normal mass hierarchy” and the latter is the “inverted mass hierarchy.” 
At this point, the sign is irrelevant because Am? appears in a term which 
is squared. However, in Sec. 5.3.1, this point will become important. 

The mixing matrix above can be described in terms of three mixing 
angles, O12, 013 and 023: 


C1213 512C13 513 
U = | —s812€93 — C12$23513 C12C23 — 8128523813 $23€13 | , (5.12) 
512523 — C1223513 —C€12523 — 51223513 C2313 


where cij = cosĝ;j and s;; = sin@;;, with iand j referring to the mass 
states. In fits to the oscillation parameters, people variously quote the 
results in terms of the matrix element of U, sin-squared of the given angle, 
sin-squared of twice the angle and a variety of other forms, all of which 
are related. Using the 13 case as an example, the quoted parameters are 
related by: 


1 
Us x% sin? 013 y 7 sin? 2043. (5.13) 


Thus, in total, there are five free parameters in the simplest three- 
neutrino oscillation model, which can be taken to be Am7,, Am33, 012, 913 
and 023. 

Although in general there will be mixing among all three flavors of neu- 
trinos, two-generation mixing is often assumed for simplicity. If the mass 
scales are quite different (m3 >> m2 >> mı, for example), then the oscil- 
lation phenomena tend to decouple and the two-generation mixing model is 
a good approximation in limited regions. In this case, each transition can 
be described by a two-generation mixing equation. However, it is possi- 
ble that experimental results interpreted within the two-generation mixing 
formalism may indicate very different Am? scales with quite different ap- 
parent strengths for the same oscillation. This is because, as is evident from 
equation 5.11, multiple terms involving different mixing strengths and Am? 
values contribute to the transition probability for Va > vg. 
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5.2.1.2. Matter Effects 


The probability for neutrino oscillations is modified in the presence of mat- 
ter. This is true in any material, however the idea was first explored for 
neutrino oscillations in the Sun, by Mikheyev, Smirnov and Wolfenstein. 
Therefore, matter effects are often called “MSW” effects.4° In general, 
matter effects arise in neutrino-electron scattering. The electron neutrino 
flavor experiences both CC and NC elastic forward-scattering with elec- 
trons. However, the v, and v, experience only NC forward-scattering, be- 
cause creation of the u or T is kinematically forbidden or suppressed (e.g. 
Vu +e —+ve+p). This difference produces the matter effect. 

For neutrinos propagating through a constant density of electrons, if Ve 
is the elastic forward scattering potential for the Ve component, and Vother 
is the potential for the other neutrino flavors, then the additional scattering 
potential is 


V= Ve — Vother = V2GFne, (5.14) 


where Gp is the Fermi constant and ne is the electron density. This poten- 
tial modifies the Hamiltonian, so that, if Ho is the vacuum Hamiltonian, 
then in matter the Hamiltonian is Hy) +V. This means that the eigenstates 
are modified from those of a vacuum, vı and 12, to become Vim and V2. 
Effectively, the neutrino mass spectrum is not the same as in vacuum. The 
solutions to the Hamiltonian are also modified. From this, one can see that 
the presence of electrons may substantially change the oscillatory behavior 
of neutrinos. 

The simplest outcome is that matter induces a shift in the mass state, 
which is a combination of flavor eigenstates, propagates through the mate- 
rial. This leads to a change in the oscillation probability: 


Prob (ve > vu) = (sin? 20/W°) sin? (1.27W Am? L/E) (5.15) 


where W? = sin? 20 + (V2G pne(2E/ Am?) — cos 20)? (Note that in a vac- 
uum, where ne = 0, this reduces to equation 5.10.) From this, one can see 
that if a neutrino, passing through matter, encounters an optimal density 
of electrons, a “resonance,” or large enhancement of the oscillation proba- 
bility, can occur. The Sun has a wide range of electron densities and thus 
is a prime candidate for causing matter effects. Also, neutrinos traveling 
through the Earth’s core, which has a high electron density, might expe- 
rience matter effects. This will produce a “day-night effect,” or siderial 
variation, for neutrinos from the Sun. 
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For situations like the Sun, with very high electron densities which vary 
with the position of the neutrino (and hence the time which the neutrino 
has lived), the situation is complex. If the electron density is high and 
the density variation occurs slowly, or adiabatically, then transition (not 
oscillation!) between flavors in the mass state can occur as the neutrino 
propagates. Thus it is possible for neutrinos to be produced in the core 
of the Sun in a given mass and flavor state, and slowly evolve in flavor 
content until the neutrino exits the Sun, still in the same mass state. In 
other words, in the Sun, a Ve produced in a mass eigenstate V2m(r), which 
depends on the local electron density at radius r, propagates as a V2, until 
it reaches the r = Rsolar, where Vom(Rsolar) = v2. This peculiar effect is 
called the Large Mixing Angle MSW solution. 


5.2.1.3. Designing an Oscillation Experiment 


From equation 5.10, one can see that three important issues confront the 
designer of the ideal neutrino experiment. First, if one is searching for os- 
cillations in the very small Am? region, then large L/E must be chosen 
in order to enhance the sin?(1.27Am?L/E) term. However if L/E is too 
large in comparison to Am?, then oscillations occur rapidly. Because ex- 
periments have finite resolution on L and FE, and a spread in beam energies, 
the sin?(1.27Am?L/E) averages to 1/2 when Am? > L/E and one loses 
sensitivity to Am?. Finally, because the probability is directly proportional 
to sin? 26, if the mixing angle is small, then high statistics are required to 
observe an oscillation signal. 

There are two types of oscillation searches: “disappearance” and “ap- 
pearance.” To be simplistic, consider a pure source of neutrinos of type x. 
In a disappearance experiment, one looks for a deficit in the expected flux of 
Vr. This requires accurate knowledge of the flux, which is often difficult to 
predict from first principles. Therefore, most modern disappearance exper- 
iments employ a near-far detector design. The near detector measures the 
flux prior to oscillation (the design goal is to effectively locate it at L = 0 
in Fig. 5.11). This is then used to predict the unoscillated event rate in 
the far detector. A deficit compared to prediction indicates disappearance. 
Appearance experiments search for Va — vg by directly observing interac- 
tions of neutrinos of type 8. The case for oscillations is most persuasive if 
the deficit or excess has the (L/E) dependence predicted by the neutrino 
oscillation formula (equation 5.10). 

The “sensitivity” of an experiment is defined as the average expected 
limit if the experiment were performed many times with no true signal (only 
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background). Let us consider the sensitivity for a hypothetical perfect (no- 
systematic error) disappearance neutrino oscillation experiment with N 
events. A typical choice of confidence level is 90%, so in this case, the 
limiting probability, assuming there is no signal, is 


P=oVN/N. (5.16) 


There are two possible choices of o associated with a 90% CL sensitivity, 
depending on the underlying philosophy. If one assumes there is no signal 
in the data, then one quotes the sensitivity based on 90% of a single-sided 
Gaussian, which is ø = 1.28. If the philosophy is that there is a signal which 
is too small to measure, then one quotes the sensitivity using o = 1.64, 
which is appropriate for a double-sided Gaussian. Historically, 0 = 1.28 
was used in most publications. Physicists engage in arguments as to which 
is most correct, but what is most important from a practical point of view 
is for the reader to understand what was used. The reader can always scale 
between 1.28 and 1.64 depending on personal opinion. 
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Fig. 5.12. An illustration of the sensitivity of an imaginary oscillation experiment. The 
region of sensitivity for an experiment depends on the oscillation probability, P, where 
one can set a limit at some confidence level. Most experiments use 90% CL. The bound- 
aries depend on P, L and E. 
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There is only one measurement, P, and there are two unknowns, Am? 
and sin? 20; so this translates to a region of sensitivity within Am? — sin? 20 
space. This is typically indicated by a solid line, with the allowed region 
on the right on a plot (see illustration in Fig 5.12). For the perfect (no- 
systematic error) experiment, the high Am? limit on sin? 20 is driven by the 
statistics. On the other hand, the L and E of the experiment drive the low 
Am? limit, which depends on the fourth root of the statistics. If our perfect 
experiment had seen a signal, the indications of neutrino oscillations would 
appear as “allowed regions,” or shaded areas on plots of Am? vs. sin? 20. 

This rule of thumb — that statistics drives the sin? 20-reach and L/E 
drives the Am? reach — becomes more complicated when systematics are 
considered. The imperfections of a real experiment affect the limits which 
can be set. Systematic uncertainties in the efficiencies and backgrounds 
reduce the sensitivity of a given experiment. Background sources introduce 
multiple flavors of neutrinos in the beam. Misidentification of the inter- 
acting neutrino flavor in the detector can mimic oscillation signatures. In 
addition, systematic uncertainties in the relative acceptance versus distance 
and energy need to be understood and included in the analysis of the data. 

For a real experiment, with both statistical and systematic errors, find- 
ing the sensitivity and final limit or allowed region requires a fit to the 
data. The data are compared to the expectation for oscillation across the 
range of oscillation parameters, and the set of parameters where the agree- 
ment is good to 90% CL are chosen. Historically, there are three main 
approaches which have been used in fits. The first method is the “single 
sided raster scan.” In this case one chooses a Am? value and scans through 
the sin?26-space to find the 90% CL limit. The second method, the “global 
scan,” explores Am?- and sin?26-space simultaneously. Thus there are 
two parameters to fit and two degrees of freedom. The third method is 
the frequentist, or “Feldman-Cousins” approach,*”’ in which one simulates 
“fake-experiments” for each Am? and sin?20 point, and determines the 
limit where, in 90% of the cases, no signal is observed. Each method has 
pros and cons and the choice is something of a matter of taste. As with 
the question of a single- or double-sided gaussian, what is important is to 
compare sensitivities, limits, and signals from like methods. 

It is possible for an experiment which does not observe a signal to set 
a limit which is better than the sensitivity. This occurs if the experiment 
observed a downward fluctuation in the background. In this case, a limit 
is hard to interpret. The latest standard practice is to show the sensitivity 
and the limit on plots, and the readers can draw their own interpretation.*” 
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5.2.1.4. Experimental Evidence for Oscillations 


Two separate allowed regions in Am?-and-sin? 26 -space for neutrino oscil- 
lations have been observed at the > 5a level. These are called the “Atmo- 
spheric Am?” and “Solar Am?” regions. The names are historical, as will 
be seen below. Many reviews have been written on these results (see, for 
example,,”,4° and4’) and so here the results are briefly outlined. 

The highest Am? signal was first observed using neutrinos produced in 
the upper atmosphere. These atmospheric neutrinos are produced through 
collisions of cosmic rays with the atmosphere. The neutrinos are detected 
through their charged-current interactions in detectors on the Earth’s 
surface. 

The first evidence for atmospheric neutrino oscillations came from the 
Kamioka®’ and IMB*! experiments. This was followed by the convincing 
case presented by the Super K experiment.°? These were single detector 
experiments observing atmospheric neutrino interactions as a function of 
zenith angle (see Fig. 5.13). Several striking features were observed. The 
first was that the v,, flavor neutrinos showed clear evidence of disappearance 
while the ve flavor CC scatters were in good agreement with prediction. 
The second striking observation was that the apparent mixing was nearly 
maximal. In other words, the experiments were seeing a 50% reduction of 
the v, event rate compared to expectation. 

Complications in the analysis arise from the difficulty in understanding 
production of atmospheric neutrinos (affecting the understanding of EF) 
and in the accurate reconstruction of events as a function of zenith angle 
(affecting knowledge of L). The Am? extracted from the Kamoiokande 
data is an order of magnitude higher than that extracted from the Super 
K data, indicating a clear systematic effect. Thus, it was absolutely crucial 
for accelerator-based “long-baseline” neutrino experiments to confirm this 
result. In these experiments, the L is well defined by the distance from 
source to detector, and the Æ is well understood from a near detector 
measurement. 

The challenge for long-baseline experiments is that the L/E required to 
access the atmospheric signal is on the order of 1000 km/GeV. If the beam 
is relatively low energy, so that the easy-to-reconstruct CCQE interaction 
dominates the events, then L is on the order of 1000 km. This leads to 
two major technical challenges. First, because the Earth is a sphere, if the 
source and detector are to be located on (or near) the surface, the beam 
must be directed downward, into the Earth. Engineering a beamline at a 
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Fig. 5.13. Event rates observed in Super K as a function of zenith angle for two energy 
ranges. Candidate ve events are on the left, v, are on the right. The red line indicates 
the predicted rate. The green line is the best fit including oscillations. 5? 


steep angle requires overcoming substantial hurdles in tunneling. Second, 
the beam spreads as it travels outward from the source, resulting in low 
intensity at the detector. Therefore, very high rates are needed. However, 
these challenges have now been overcome at three accelerator complexes: 
KEK, FNAL and CERN, and a new long-baseline beam from the JPARC 
facility will be available soon. Making use of these lines, initial confirmation 
of the atmospheric neutrino deficit came from the KEK-to-Kamiokande 
(K2K) long baseline experiment.*+ This has since been followed up by the 
MINOS experiment to high precision.°°* 

In the atmospheric data, the ve CC signal is in agreement with expecta- 
tion, and in the long-baseline experiments, no Ve excess has been observed. 
Therefore, one cannot interpret this oscillation signal as v, — ve. This 
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leaves only v, — v, as an explanation for the deficit in a three-neutrino 
model. Observation of v, CC interactions is experimentally difficult in these 
experiments for a number of reasons. First, the L/E of the signal is such 
that for lengths available to present experiments, the energy of the beam 
must be low (< 10 GeV). As discussed in Sec. 5.1.2, because the 7 mass 
is 1.8 GeV, there is substantial mass suppression for 7 production at low 
energies, so the CC event rate is low. Second, the 7 decays quickly, leaving 
behind a complicated event structure which can be easily confused with v, 
and ve low multiplicity interactions in calorimeter or Cerenkov detectors. 
The difficulty of identifying v, events even in a specialized emulsion-based 
detector with a high energy neutrino beam, was made clear by the DoNuT 
experiment,® which provided the first, and so far only, direct observation 
of CC v, interactions. Thus, while some studies claim observation of vy 
CC interactions in SuperK,°” these results are not very convincing to this 
author. Fortunately, a specialized experiment called OPERA,°® which is 
an emulsion-based long-baseline detector, is presently taking data. The 
average energy of the CNGS beam used by this experiment is 17 GeV, 
sufficiently high to produce v, CC events. This experiment is expected to 
observe ~ 15 events in 5 years of running if the atmospheric neutrino deficit 
is due to v, > v, with Am? = 2.5 x 1073 eV?.*° 

The lower Am? signal is called the “Solar Neutrino Deficit,” as it was 
first observed as a low rate of observed ve’s from the Sun. The first obser- 
vation of this effect was a Ve deficit observed using a Cl target! by Ray 
Davis and collaborators at Homestake, using Ve + Cl > e+ Ar. Only about 
1/3 of the total expected neutrino event rate was observed. By 1999, four 
additional experiments had confirmed these observations. The GALLEX!® 
and SAGEP experiments confirmed a deficit for CC electron neutrino inter- 
actions in a Ga target producing Ge. The Super Kamiokande experiment 
observed a deficit for ve + e > ve + e reactions in water.?? The deficit is 
shown on Fig. 5.14, indicated by the blue points. This plot shows the ratio 
to the Standard Solar Model prediction, which is indicated by the solid line 
at unity. 

A few aspects of the initial solar neutrino deficit studies should be noted. 
First, the three types of experiments, chlorine-based, gallium-based, and 
water-based, measured different levels of deficit. Given that each type of 
nucleus has a different low energy threshold for observation of CC events, 
as previously discussed, one can interpret the varying levels of deficit as an 
energy dependent effect. Second, all of the above experiments rely upon 
the CC interaction. The energy of neutrinos from the Sun is so low, that 
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Fig. 5.14. Ratio of observed event rates in solar neutrino experiments compared to the 
Standard Solar Model. Experiments are plotted at the average energy of the detected 
signal, which varies due to detection threshold. Black error bars indicate Standard Solar 
Model error.?3 


should vy, or v, be produced through oscillations, the CC interaction could 
not occur. This is because of the relatively high mass of the u (106 MeV) 
and the 7 (1.8 GeV). Thus all of these experiments can observe that ves 
disappeared, but they cannot observe if the neutrinos reappear as one of 
the other flavors. This makes a decisive statement that the effect is due to 
neutrino oscillations problematic. 

For some time, people argued the apparent deficit was due to an in- 
complete picture of solar processes. The two important theoretical issues 
related to the solar neutrino fluxes were the fusion cross sections and the 
temperature of the solar interior. A comprehensive analysis of the available 
information on nuclear fusion cross sections important to solar processes 
has been compiled®? and shows that the important cross sections are well- 
known. Results in helioseismology provided an important further test of the 
“Standard Solar Model”.®! The Sun is a resonant cavity, with oscillation 
frequencies dependent upon P/p, the ratio of pressure to density. Helioseis- 
mological data confirmed the SSM prediction of U to better than 0.1%.°? 
With the results of these studies, most physicists were convinced that the 
Standard Solar Model was substantially correct. The error bars on the 
black line at unity in Fig. 5.14 shows the side of the estimated systematic 
error on the Standard Solar Model. 
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Interpreting the results as neutrino oscillations resulted in a complicated 
picture. The vacuum oscillation probability, calculated using equation 5.10, 
results in allowed regions of Am? which are very low (Am? ~ 107!eV”). 
This is because the energy of the neutrinos is only a few MeV, and the Sun 
to the Earth pathlength is very long (~ 10''m) . On the other hand, the 
Sun has high electron content and density, so matter effects (Sec. 5.2.1.2) 
could interfere with the picture, allowing higher true values of Am?. The 
MSW effect yielded two solutions in fits to the data. One was at mixing 
angles of ~ 1073. Until very recently, this was regarded as the most likely 
solution based on analogy with mixing in the quark sector. The other 
solution gave a very large, although not maximal, mixing angle. 

Two dramatic results of the early 2000’s demonstrated that the solar 
neutrino deficit was due to oscillations with the MSW effect and with large 
mixing angle. The first result was from the SNO experiment.’ SNO used 
a D20 target which allowed for measurement of both CC ve interactions as 
well as v +d —v +n +p. In the former measurement, SNO sees a deficit 
consistent with the other measurements within an oscillation interpretation, 
and which yields a ve flux of (1.76 + 0.05(stat) + 0.09(sys)) x 106 /cm?s.64 
The later measurement is an NC interaction, and thus is flavor-blind. It 
yields a total NC flux of (5.099-44 (stat) 1045 (sys)) x 106/cm?s®* which can 
be compared with the theoretical prediction of (5.69 + 0.91) x 10°/cm?s.® 
In other words, SNO observed the expected total event rate, within errors. 
This implied that the ves are oscillating to neutrinos which participate in 
the NC interaction, v,s and/or vrs, with the total v, + v+ flux equal to 
(3.41 + 0.45(stat) 7048 (sys)) x 10®/cm?s.® The results of two runs of the 
SNO experiment are shown by the red and green points on Fig. 5.14. The 
second result was from the KamLAND experiment. This was a reactor- 
based experiment located in Japan. Using many reactors which were hun- 
dreds of kilometers away, the KamLAND experiment was able to reach 
L/E ~ 107 m/MeV. This covered the MSW allowed-Am? solution. The 
statistics were on the order of hundreds of events, but this was enough 
to probe the large mixing-angle MSW solution. KamLAND expected 365 
events and observed 258 events, and thus had clear evidence for oscilla- 
tions with large mixing, tan? 0 = 0.40+%010 and relatively high Am?, of 
7.9408 x 1075 eV?.°6 The energy distribution of the events observed in 
KamLAND is shown in Fig. 5.15. 

Based on the atmospheric and solar studies, there are two squared mass 
differences: Am?,,. and Am? The smaller is identified with the mass 


solar atmos* 
splitting between vı and v2: Am?, = Am2,),,.. The atmospheric deficit 
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Fig. 5.15. Events in KamLAND as a function of energy. The grey line indicates the 
expectation for no oscillation.® 


measures a combination of Am33; and Am?;. However, since Amj3 = 
Am}, + Am2, and Am?, is small, Am?, ~ Am3, = AMŽimos: 

A recent global analysis of the data®” from the above experiments yields 
a consistent picture for three neutrino oscillations with five free parameters. 
The mass differences are: Am?. = (7.9 + 0.3) x 1075eV? and |Am?3| = 
(2.57332) x 10-5eV, where the absolute value is indicated in the second 
case because the sign (i.e. the mass hierarchy) is unknown. The two well- 
measured mixing angles are determined to be: sin? 6j. = 0.307033 and 
sin? 053 = 0.507938. One mixing angle, 013 is yet to be measured, but a 
limit of sin? 6,3 < 0.025 can be placed based on global fits. 

Based on the measurements, the mixing matrix of Eq. 5.12, translates 
roughly into: 


0.8 05 ? 
U={04 06 07). (5.17) 
0.4 06 0.7 


This matrix, with its large off-diagonal components, looks very different 
from the quark-sector mixing matrix where the off-diagonal elements are 
all relatively small. In this matrix, the “odd element out” is U.3 which 
is clearly substantially smaller than the others. At this point, there is no 
consensus on what this matrix may be telling us about the larger theory, but 
there is a sense that the value of 013 is an important clue. Theories which 
attempt to explain this matrix tend to fall into two classes — those where 
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Table 5.3. Selected predictions for sin? 2013.104 


Model(s) Refs. approximate sin? 2013 
Minimal SO(10) 68 0.13 
Orbifold SO(10) 69 0.04 
SO(10) + Flavor symmetry 0 1.2- 10-6 
ig 7.8: 10-4 

TRTA 0.01 .. 0.04 

oat 0.09 .. 0.18 

SO(10) + Texture T8 4.1074 .. 0.01 
12 0.04 

SU(2), x SU(2)r x SU(4)e 80 0.09 
Flavor symmetries 81-83 0 
84,85,94 < 0.004 

eS? 10-4 .. 0.02 

90-24 0.04 .. 0.15 

Textures g5 4.107%.. 0.01 
20789 0.03 .. 0.15 

3 x 2 see-saw TOU 0.04 
Anarchy TOL > 0.04 
Renormalization group enhancement T02 0.03 .. 0.04 
M-Theory model 103 1074 


013 is just below the present limit and those with very small values. As an 
illustration of this point, Table 5.3 shows order of magnitude predictions 
for a variety of theories. Thus a measurement of sin? 2613 which is greater 
than about 1%, or a limit at this level, can point the way to the larger 
theory. 

The best method for measuring 613 is from reactor experiments which 
constrain this oscillation by searching for De disappearance. The oscillation 
probability is given by: 


Preactor ~ sin? 2613 sin? A+ a? A? cos* 013 sin? 20192, (5.18) 

with 
a = Am3,/Am3; (5.19) 
A= Am, L/(4E,). (5.20) 


Events are detected through the inverse beta decay (IBD) interaction. The 
CHOOZ experiment,”° with a baseline of 1.1 km and typical neutrino event 
energies between 3 and 5 MeV (E) = 3.5 MeV) has set the best reactor- 
based limit to date, of sin? 2013 < 0.27 at Am? = 2.5 x 1073 eV?. This 
limit can be improved with a global fit, as quoted above. 

Significant improvement is expected from the upcoming round of reactor 
experiments results due to introducing a near-far detector design. The near 
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detector measures the unoscillated event rate, and the far detector is used to 
search for a deficit as a function of energy. The Double Chooz experiment, 
beginning in 2009, is expected to reach sin? 26,3 ~ 0.03.!° This will be 
followed by the Daya Bay experiment which will reach ~0.01.1°6 


5.2.2. Direct Measurements of Neutrino Mass 


For neutrinos, there are no mass measurements, only mass limits. Observa- 
tions of neutrino oscillations are sensitive to the mass differences between 
neutrinos, not the actual mass of the neutrino. Therefore, they do not fall 
into the category of a “direct measurement”. One can, however, use these 
oscillation results to estimate the required sensitivity for a direct mass mea- 
surement. The upper limit comes from assuming that one of the neutrino 
masses is exactly zero. Given that the largest Am? is ~ 3 x 1078 eV?, this 
implies there is a neutrino with mass V Am? ~ 0.05 eV. The mass of the 
neutrino can be directly measured from decay kinematics and from time of 
flight from supernovae. Neither method has reached the 0.05 eV range yet, 
although the next generation of decay-based experiments comes close. 

We know from neutrino oscillations that there is a very poor correspon- 
dence between neutrino flavors and neutrino masses, i.e., the mixings are 
large. However, it is easiest to conduct the discussion of these limits in 
terms of specific flavors. Thus, what is actually being studied is an average 
mass associated with each flavor. For example, for the ve mass measured 
from 8 decay, which will be expanded upon below, what is actually probed 


1s: 


The simplest method for measuring neutrino mass is applied to the v,. 
The mass is obtained from the 2-body decay-at-rest kinematics of 7 > vy- 
One begins in the center of mass with the 4-vector relationship: pr = 


Pu + py. Squaring and solving for neutrino mass gives: m? = mz + mî, — 


4m2(|p,,|? + mz). From this, one can see that this technique requires 
accurate measurement of the muon momentum, p,,, as well as the masses 
of the muon, m, and the pion, my. In fact, the uncertainty on the mass of 
the pion is what dominates the v, mass measurement. As a result, a limit 
is set at my, < 170 keV.107108 
The mass for the v, is obtained from the kinematics of r decays. The 
T typically decays to many hadrons. However, the four vectors for each of 
the hadrons can be summed. Then the decay can be treated as a two-body 
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Table 5.4. Overview of ve squared mass measurements. 


Experiment measured m? (eV?) | limit (eV), 95% C.L. | Year 
Mainz 0 -0.6 2.2 2.1 2.2 2004 
Troitsk!!! -1.0 + 3.0+ 2.1 2.5 2000 
Mainz!!2 -3.7 + 5.3 4 2.1 2.8 2000 
LLNL!3 - 130 + 20+ 15 7.0 1995 
CIAE!!4 - 31 + 75+ 48 12.4 1995 
Zurich!!5 -24 + 48+ 61 11.7 1992 

Tokyo INS!16 - 65 + 85+ 65 13.1 1991 
Los Alamos!!? - 147 + 68+ 41 9.3 1991 


problem with the neutrino as one 4-vector and the sum of the hadrons as 
the other vector. At this point, the same method described for the v, can 
be applied. Measurements are again error-limited, so a limit on the mass 
is placed. The best limit, which is m,, < 18.2 MeV, comes from fits to 
T > 2r rtv, and T~ — 327 2r+(n?)v, decays observed by the ALEPH 
experiment. 109 

The experimental situation for the ve mass measurement is more com- 
plicated. The endpoint of the electron energy spectrum from tritium (@ 
decay is used to determine the mass. Just as in the case of the v, and vr, 
the experiments measure a value of m?. The problem is that the measure- 
ments have been systematically negative. A review of measurements, as a 
function of time, is given in Table 5.4. Recent measurements at Troitsk!!! 
and Mainz!!? 


ticle Data Group prescription for setting limit in the case of an unphysical 


are negative, but in agreement with zero. Following the Par- 


results, m? = 0 is assumed, with the quoted errors. Based on these results, 
one can extract a limit of approximately < 2 eV for the mass of the ve. 

The next big step in the measurement of neutrino mass from decay kine- 
matics will come from the Katrin Experiment.'!® Katrin will use tritium 
beta decay to measure the mass of the neutrino to 0.2 eV. This does not 
reach the range of 0.05 eV, which our simplistic argument presented at the 
top of this section indicated. However, that argument assumed that the 
lightest neutrino had zero mass. A small offset from zero easily boosts the 
spectrum into the range observable by Katrin. On the other hand, Katrin 
is sensitive to only electron flavor. Thus, its sensitivity depends up on the 
amount of mixing of Ve within the heaviest neutrino. 

Another method for measuring neutrino mass from simple kinematics is 
to use time of flight for neutrinos from supernovae. Neutrinos carry away 
~ 99% of the energy from a supernova. The mass limit is obtained from 
the spread in the propagation times of the neutrinos. The propagation time 
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for a single neutrino is given by 


2 


m 
tobs — temit = 1 =) 22 
bs — temit to( + 5p (5.22) 


where to is the time required for light to reach Earth from the supernova. 
Because the neutrinos escape from a supernova before the photons, we do 
not know temiz. But we can obtain the time difference between 2 events: 


2 
Atons — Atemit © (Ee — - (5.23) 
using the assumption that all neutrinos are emitted at the same time, one 
can obtain a mass limit of ~ 30 eV from the ~ 20 events observed from 
SN1987a at 2 sites.119120 

This is actually an oversimplified argument. The models for neutrino 
emission are actually quite complicated. The pulse of neutrinos has a 
prompt peak followed by a broader secondary peak with a long tail dis- 
tributed over an interval which can be 4 s or more. The prompt peak is 
from “neutronization” and is mainly ve, while all three neutrino flavors pop- 
ulate the secondary peak. However, the rate of Ve escape is slower compared 
to v, and v, produced at the same time, because the ves can experience 
CC interactions, while the kinematic suppression from the charged lepton 
mass prevents this for the other flavors. However, when all of the aspects 
of the modeling are put together, the bottom line remains the same: it will 
be possible to set stringent mass limits if we observe neutrinos from nearby 
supernovae. 

Some argue that cosmology provides a “direct measurement.” Cosmo- 
logical fits have sensitivity to neutrino masses, but the results are dependent 


on the cosmological parameters!*+ 


and the model for relic neutrino produc- 
tion. There are many examples of models with low relic neutrino densities 
which would significantly change the present interpretation of the cosmo- 
logical data.1?? In the opinion of the author, until these issues are settled, 
cosmological measurements cannot convincingly compete with kinematic 


decays and supernova measurements, despite aggressive claims. 


5.3. Neutrinos We Would Like to Meet 


Now that we know that neutrinos have mass, and thus are outside of ex- 
pectations, the obvious question is: “what other Beyond Standard Model 
properties do they possess?” The APS Study on the Future of Neutrino 
Physics focussed on this question. The plan for attack was divided into 
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three fronts: (1) Neutrinos and the New Paradigm, (2) Neutrinos and the 
Unexpected and 3) Neutrinos and the Cosmos. The remainder of this paper 
follows this structure. 

The consequences of the discovery of neutrino mass leads to a rich array 
of ideas. It is was beyond the scope of these lectures to cover the entire 
spectrum. So, in each of the three areas, two topics are chosen for extensive 
discussion. The reader is referred to the study’ and the accompanying 
theory white paper? for further ideas. 


5.3.1. Neutrinos and the New Paradigm 


The first step in creating a “New Standard Model” is to incorporate neu- 
trino mass. The simplest method is to introduce a Dirac mass, by analogy 
with the electron. This allows us to introduce a small neutrino mass, simply 
by arguing that the coupling to the Higgs is remarkably small. However, 
the unlikely smallness of the coupling has pushed theorists to look for other 
approaches. Among the oldest of these ideas is that neutrinos may be “Ma- 
jorana particles,” i.e., they are their own antiparticle. This leads to a new 
type of mass term in the Lagrangian. Through the “see-saw” mechanism, 
which fits well with Grand Unified Theories, this can also give a motivation 
for the apparently small value of the neutrino masses. 

A direct consequence of the Majorana See-Saw Model is a heavy neu- 
trino, with mass near the GUT scale. Because the heavy neutrino gets its 
mass through the Majorana rather than Dirac term of the Lagrangian, this 
neutrino was massive during the earliest periods of the universe, before the 
electroweak phase transition. The decays of such a heavy lepton could be 
CP violating. This would provide a mechanism for producing the observed 
matter-antimatter imbalance seen today. 

The tidiness of the the above theoretical ideas has caused this paradigm 
to emerge as the consensus favorite for the “New Standard Model.” How- 
ever, there is absolutely no experimental evidence for this theory at this 
time. We have no evidence for the Majorana nature of neutrinos. Nor do 
we have any evidence for CP violation in the neutrino system. The great 
challenge of the next few years, then, is to find any sign at all that this 
theory is correct. 

This section reviews how one introduces mass into the Lagrangian. The 
search for evidence of the Majorana nature of neutrinos though neutrinoless 
double beta decay is considered. Then, the prospects for finding evidence 
for CP violation is considered. 
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5.3.1.1. How Neutrinos Might Get Their Mass 


The simplest assumption is that the neutrino mass should appear in the 
Lagrangian in the same way as for the charged fermions — via a Dirac mass 
term. In general, the Dirac mass term in the Lagrangian will be of the form 


mdvrpr+vrwvz). (5.24) 


From the arguments presented in eqs. 5.1 through 5.6, we saw that the 
scalar “mass” term mixes the RH and LH states of the fermion. If the 
fermion has only one chirality, then the Dirac mass term will automatically 
vanish. For this reason, a standard Dirac mass term for the neutrino will 
require the RH neutrino and LH antineutrino states. 

To motivate the mass term, the most straightforward approach is to 
use the Higgs mechanism, as was done for the electron in the Standard 
Model. In the case of the electron, when we introduce a spin-0 Higgs 
doublet ,(h°, h+), into the Lagrangian, we find terms like: 


Jelen (Wr, (ht)t + We, (h?) + h.c., (5.25) 


where ge is the coupling constant and “h.c.” is the Hermetian conjugate. 
The piece of this term proportional to We,We,(h°)', combined with its 
Hermetian conjugate, can be identified with the Dirac mass term, metewe. 
We set (h?) = v/V2, so that we obtain g(h°)y We and me = gev/V2. 
This is the Standard Model method for conveniently converting the ad hoc 
electron mass, me, into an ad hoc coupling to the Higgs, ge and a vacuum 
expectation value (VEV) for the Higgs, v. Following the same procedure 
for neutrinos allows us to identify the Dirac mass term with my = g,v/ J2. 
The VEV, v, has to be the same as for all other leptons. Therefore, the 
small mass must come from a very small coupling, g,. This implies that 
Je > 5 X 104gr. 

There are several troublesome features to this procedure. The first issue 
which is often raised is: 


e Why would the Higgs coupling vary across eleven orders of magni- 
tude (the approximate ratio of the neutrino mass to the top quark 
mass)? 


In fact, this question is rather odd. Disregarding the neutrinos, the masses 
of the charged fermions varies across six orders of magnitude (from the 
electron mass to the top mass). If six orders of magnitude do not bother 
anyone, why should eleven? Turning this around, if the Higgs couplings 
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alreay seemed stretched in the charged fermion case, the neutrinos stretch 
the argument much further. This leads to the second troublesome issue, 


e Physically, what is occurring? 


The Higgs mechanism really gives little physical insight. While it does 
introduce mass, it has simply shifted the arbitrariness of the magnitude of 
the mass into an arbitrary coupling to a new field. 

These two questions have led theorists to look at other explanations 
for small neutrino mass. It has been noted that neutrinos have the unique 
feature of carrying no electric or strong charge. Thus, neutrinos, alone 
among the Standard Model fermions, may be their own antiparticle, i.e. 
they may be Majorana particles. The nice consequence of this is a somewhat 
more motivated theory of mass for neutrinos. 

To understand this, first consider what is meant to be a Dirac versus 
a Majorana particle. If neutrinos are Dirac particles, then the v and the 
D are distinct particles, just as the electron and positron are distinct. The 
particle, v has lepton number +1 and the antiparticle, 7 has lepton number 
—1. Lepton number is conserved in an interaction. Thus, using the muon 
family as an example, vs (L = +1) must produce w~ (L = +1) and vs 
(L = —1) must produce u* (L = —1). The alternative viewpoint is that 
the v and P are two helicity states of the same “Majorana” particle, which 
we can call “v3,” The m+ decay produces the left-handed v™” and the 
ma decay produces the right-handed v™*/. This model explains all of the 
data without invoking lepton number and has the nice feature of economy of 
total particles and quantum numbers, but it renders the neutrino different 
from all other Standard Model fermions. 

Saying that the neutrino is its own antiparticle is equivalent to saying 
that the neutrino is its own charge conjugate, Y = w. The operators 
which appear in the Lagrangian for the neutrino in this case are the set 
(dr, Ur, VE, YR) and (br, YR, YL, Wp). Certain bilinear combinations of 
these in the Lagrangian can be identified as Dirac masses (i.e. m(pr Yr + 
...)). However, we also get a set of terms of the form: 


(Mr/2)(%L VL) + (Mr/2)(br'br) +++: (5.26) 


These are the “Majorana mass terms,” which mix the pair of charge- 
conjugate states of the fermion. If the particle is not its own charge conju- 
gate, then these terms automatically vanish and we are left with only the 
Dirac terms. Dirac particles have no Majorana mass terms, but Majorana 
particles will have Dirac mass terms. 
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The mass terms of the Lagrangian can be written in matrix form: 


(1/2)(0 dr) 4 A és + he, (5.27) 

The Dirac mass, m, is on the off-diagonal elements, while the Majorana 

mass constants, Mz, Mp are on the diagonal. To obtain the physical 
masses, one diagonalizes the matrix. 

One can now invoke “see-saw models” which motivate small observable 

neutrino masses. It turns out that GUT’s motivate mass matrices that look 


like!?4; 
0 My 
(0m), a 


with my < M. When you diagonalize this matrix to obtain the physical 
masses, this results in two states which can be measured experimentally: 


Milight ~ m2 /M, (5.29) 
Mheavy © M (5.30) 


Grand Unified Theories favor very large masses for the “heavy neutrino” 
(often called a “neutral heavy lepton” ). It is argued that it is most “natural” 
to have M be at the GUT scale. If M ~ 107° eV, and Might < 1 eV, 
as observed, then m, ~ 101? eV, or is at the TeV scale. This is rather 
high compared to masses of other leptons, but not so far beyond the top 
quark mass to regard the connection as crazy. So while some arbitrariness 
remains in this model, nevertheless there is a general feeling in the theory 
community that this is an improvement. 

In this theory neutrinos have only approximate handedness, where the 
light neutrino is mostly LH with a very small admixture of RH and the 
neutral heavy lepton is essentially RH. Thus we have a LH neutrino which 
is light, which matches observations, and a RH neutrino which is not yet 
observed because it is far too massive. 


5.3.1.2. Majorana vs. Dirac? 


How can we experimentally tell the difference between the Dirac (v, 7) and 
Majorana (v™%) scenarios? One can imagine a straight-forward thought 
experiment. First, produce left-handed neutrinos in 7* decays. These may 
be vs or they may be v7"4/s. Next, run the neutrino through a magic 
helicity-flipping device. If the neutrinos are Majorana, then what comes 


maj 


out of the flipping-device will be vp;;. These particles will behave like 
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antineutrinos when they interact, showing the expected RH y-dependence 
for the cross section. But if the initial neutrino beam is Dirac, then what 
comes out of the flipping-device will be right-handed vs, which are sterile. 
They do not interact at all. Such a helicity-flipping experiment is presently 
essentially impossible to implement. If neutrinos do have mass, then they 
may have an extremely tiny magnetic moment and a very intense magnetic 
field could flip their helicity. But the design requirements of such an ex- 
periment are far beyond our capability at the moment. Therefore, at the 
moment, we do not know if neutrinos are Majorana or Dirac in nature. 

Instead, experimentalists are pursing a different route. The Majorana 
nature of the neutrino can lead to an effect called neutrinoless double 8 
decay: (Z, A) => (Z+ 2, A)+ (e7 e7). This is a beyond-the-Standard Model 
analogue to double 8 decay: (Z, A) —> (Z + 2, A) + (e7 e7 DeDe). Double 8 
decay is a standard nuclear decay process with a very low rate because there 
is a suppression proportional to (Gp cos@c)*. Therefore, in most cases, if 
the weak decay is possible, single 8 decay ((Z, A) — (Z + 1, A) +e7 + De) 
will dominate. However, there are 13 nuclei, including ’®Xe — '6Ba 
and Ge —"6 Se, for which single 8 decay is energetically disallowed. In 
these cases double 8 decay with two neutrinos has been observed.!?? If 
the neutrino were its own antiparticle, then the neutrinos produced in the 
double 8 decay process could annihilate, yielding neutrinoless double 8 
decay. 

If there are Majorana neutrinos, then the amplitude for 0v 88 is propor- 
tional to the square of 


moves = > Um. (5.31) 


This should be contrasted with Eq. 5.21. The 0v{@ searches are probing 
different effective masses than the direct searches and the two results yield 
complementary information. Like the direct searches, the possibility of 
seeing 0v 3G depends on the amount of electron-flavor mixed in the most 
massive neutrino state. If this is small, then the rate of decay will be very 
low. Thus the hierarchy of the neutrino states affects our ability to observe 
0v6. To completely untangle the Dirac vs. Majorana question, three 
different experiments — direct mass measurement, hierarchy measurement 
and 0vG measurement — may be required.” 

Extracting mo,88 from a measured half-life leads to a theoretical error 
from the nuclear matrix element calculations. A favored style of calculation 
uses the “QRPA” (Quasiparticle Random Phase Approximation)!2° 178 
model. Using !°°Mo as an example, different matrix elements from QRPA 
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Fig. 5.16. Spectrum for two-neutrino double 8 decay and expected peak for neutrinoless 
double 8 decay. 


calculations cause mo,8g to vary by up to 2 eV for a half-life of 4.5 x 1073 
years./?9 So the error is significant. 

The 0vGG events must be separated from the standard two-neutrino 
double 8 (2vG8) decay background. This can be done through simple 
kinematics cuts. The two-body nature of 0v 68 decay will cause a peak at 
the endpoint of the 2v36 decay (4-body) spectrum, as shown in Fig. 5.16. 
An advantage of observing 21,33, however, is that measurement of its half- 
life allows direct measurement of the matrix element. At this point the 
2v3 decay spectrum has been observed in 10 elements. In some cases, 
such as !°°Mo, the the 2v half-life is well measured and can be used 
to constrain nuclear matrix element calculation. For this case, NEMO-3 
reports a half life of (7.68 + 0.02(stat) + 0.54(sys)) x 1018 y.13° 

At present, no signal for 0v8 decay has been clearly observed. The 
present 90% CL limit on the lifetime from CUORICINO on !°°Te is 
1.8 x 1074 years, corresponding to limit of Moves < 0.2 — 1.1 eV.'*' The 
NEMO-3 experiment has set 90% CL limits of 4.6 x 10% and 1.0 x 1073 
on '°°Mo and ®Se, respectively.!?®? The corresponding limits on mova 
are 0.7-2.8 eV and 1.7-4.9 eV.!?° There is a candidate signal observed at 
4.2g from a Germanium detector,'*? although the statistical significance is 
under debate.? The measured half-life was 1.19 x 10?5 years. Until this 
result is confirmed by further experiments, it is best to reserve judgment. 

Luckily, a range of future 0vG@ decay experiments are on the horizon. 
These are expected to probe an order of magnitude further in lifetimes. 
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Fig. 5.17. Example of two diagrams for Neutral Heavy Lepton decay which can interfere 
to produce CP violation. 


t,133 will turn on 


In particular, the germanium-based GERDA experimen 
soon and will address the existence of the possible signal. CUORE,!*4 
SuperNEMO,!° EXO,!36 Majorana!?’ and Moon?’ will extend the search 
even further using a wide range of elements. The reach of these near future 
Ov 2G covers the prediction for the inverted mass hierarchy. 


5.3.1.3. CP Violation in the Neutrino Sector 


An intriguing aspect of the “new Standard Model” is the heavy GUT-scale 
neutrinos which gain mass through the Majorana terms in the Lagrangian. 
There could be more than one, and likely, given the trend in the Standard 
Model, there would be three, so we can label these N1, N2 and N3. These 
heavy neutrinos have mass prior to the electroweak phase transition in 
which the Dirac terms appear. As a result, prior to the electroweak phase 
transition, decays shown in Fig. 5.17 are possible. Both decays produce the 
same final state, Nı — L/H, where £ and H are oppositely charged. These 
diagrams interfere, and can lead to a different decay rate to €~ and F, 
which is CP violation. 

This form of CP violation would lead to a lepton asymmetry in the early 
universe which could be transferred into a baryon asymmetry. A mechanism 
for this already appears in the Standard Model, in which but B, baryon 
number, and L, lepton number are not conserved, but the difference, B—L, 
is exactly conserved. B and L violation occurs in transitions between vac- 
uum states at high energies, called the “sphaleron process.” Variations on 
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this mechanism, called “leptogenesis,” may explain the matter-antimatter 
asymmetry we see today. 

Nı, N2, and N3 are far too massive to be produced at accelerators in 
the near future. Thus observing CP violation in their decays is out of 
the question. However, observing CP violation in the light neutrino sector 
would be a plausible hint that the theory is correct. 

To incorporate CP violation into the three-light-neutrino model, the 
leptonic mixing matrix is expanded and written as: U¥"" CP = VK. In 
this case, V is very similar to the U of Eq. 5.12, but with a CP violating 
phase, 6: 

id 


€12C13 §12C13 $13€ | 
V = | —812€93 — C12823813€"? C1223 — 8125238136? $23€13 
$12523 — €12€23813e" —C€1 2823 — $12€93813e €23C13 
(5.32) 
This is analogous to the CKM matrix of the quark sector. The other term, 
K = diag (1, ec’, ei(¢2+9)) (5.33) 


has two further CP violating phases, ¢; and ¢. 

Now, we potentially have three non-zero CP violating parameters in 
the light neutrino sector, 6, ¢, and ¢2, as well as one or more CP violating 
parameters in the heavy neutrino sector, where the number depends upon 
the total number of N. In the Lagrangian, these all come from a matrix 
of Yukawa coupling constants. In principle, all of these phases can take on 
the full range of values, including exactly zero. However, it is difficult to 
motivate a theory in which some are nonzero and some are exactly zero. It 
is expected that these parameters will either all have non-zero values or all 
be precisely zero. If the latter case, then the difference between the lepton 
sector, with no CP violation, and quark sector, with clear CP violation, 
must be motivated. As a result, observation of CP violation in the light 
neutrino sector is regarded as the “smoking gun” to CP violation in the 
heavy sector. 

Returning to the light neutrino sector, how can the CP phases be mea- 
sured? The ¢ phases arise as a direct consequence of the Majorana nature 
of neutrinos. Therefore, in principle, the the @ phase associated with the 
electron family is accessible in neutrinoless double beta decay. In practice, 
this will be extremely hard to measure because this term manifests itself 
as a change in the sum in Eq. 5.31, which is proportional to the 0vG@ 
decay amplitude. Thus one seeks to measure a deviation of the (as-yet- 
unmeasured) 0vGG lifetime from the prediction which depends upon the 
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mixing angles (with relatively large errors at present), the (unknown) neu- 
trino masses, and the (poorly known) nuclear matrix element. Even if the 
effect is large, observation of the effect is clearly hopeless in the near future. 
On the other hand, 6, the “Dirac” CP violating term in V may be accessible 
though oscillation searches. 

CP violation searches involve observing a difference in oscillation prob- 
ability for neutrinos and antineutrinos. Only appearance experiments can 
observe CP violation. A difference between oscillations of neutrinos and 
antineutrinos in disappearance searches is CPT violating. In oscillation ap- 
pearance searches, the K matrix does not affect the oscillation probability 
because this diagonal matrix is multiplied by its complex conjugate. On the 
other hand, non-zero 6 can be observed. To test for non-zero 6, the oscilla- 
tion probability must depend upon the Ue3 component of Eq. 5.32. In other 
words, the search needs to involve transitions from or to electron flavor and 
involve the mass state v3. This combination of requirements — appearance 
signal, electron flavor involvement, and v3 mass state involvement — leads 
to one experimental option at present: comparison of V, — Ve to Vy > Ve 
at the atmospheric Am?, which is Am?,. The oscillation probability is 
given by: 


. 2 + 2 2 
Priong—baseline + sin 2613 sin 023 sin” A 


F a sin 2613 sindcp cos 613 sin 2812 sin 2093 sin? A 


+ a sin 2613 cosdcp cos 643 sin 2012 sin 2623 cos A sin? A 


+ a? cos? 693 sin? 2012 sin? A, (5.34) 


where a and A are defined in Eq. 5.20. The second term is negative for 
neutrino scattering and positive for antineutrino scattering. 
Unfortunately, Eq. 5.34 convolutes two unknown parameters, the sign 
of Am?, (the mass hierarchy) and the value of 613, with the parameter, 
6, that we want to measure. The problem of the mass hierarchy can be 
mitigated by the experimental design. The sign of Am? = m3— m? affects 
the terms where Am? is not squared. These terms arise from matter effects 
and can so be reduced if the pathlength in matter is relatively short. For 
long baseline experiments, which must shoot the beam through the Earth, 
this means that L must be relatively short. In order to retain the same L/E 
and, hence, the same sensitivity to Am?,, E must be comparably reduced. 
On the other hand, the problem of 013 cannot be mitigated. From Eq. 5.32, 
one sees that we are in the unfortunate situation of having the CP violating 
term multiplied by sin 013. The smaller this factor, the harder it will be to 
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extract ô. If sin? 2013 is smaller than ~ 0.01 at 90% CL, then substantial 
improvements in beams and detectors will be required. 

Equation 5.34 also depends on two other as-yet-poorly understood pa- 
rameters, 23 and the magnitude of Am33. Disappearance experiments 
measure sin? 2023 = 1.00+9-16 67 thus there is an ambiguity as to whether 
623, is larger or smaller than 45°. Am; is only known to about 10%.56:67 
This measurement is extracted from the location of the “dip” in the 
rate versus L/E distribution of disappearance experiments, and is already 
systematics-dominated. Improvement requires experiments with better en- 
ergy resolution? and better understanding of the CCQE and background 
cross sections.!? These errors lend a significant error to the analysis. 

Lastly, it is difficult to measure a ve signal which is at the ~ 1% level. 
Most v, beams have a substantial ve intrinsic contamination from u and 
K decays. Given that 013 is small, this contamination is a serious issue. 
One solution to this problem is to go to an off-axis beam design. This relies 
on the tight correlation between energy and off-axis angle, 0, in two-body 
decays. For pion decay, which dominates most beams, 

_ 0.43Er 
vy — 1+ 762’ 

where y = E,,/m, is the Lorentz boost factor. The solution for two-body K 
decay replaces 0.43 with 0.96. Thus at 0 = 0, the relationship between EF, 
and E, is linear. However, for larger 0, above a moderate energy threshold, 
all values of E, map to the same E,. This is illustrated in Fig. 5.18. The 
result is that an off axis v, beam which comes largely from pion decay is 
tightly peaked in energy while the ve intrinsic background is spread across 
a range of energies. This also helps to reduce vy, events which are mis- 
reconstructed as Ve scatters, such as NC 7° production where a photon is 
lost. These “mis-ids” tend to be spread across a range of energies, since 
the true energy is misreconstructed. Thus a peaked signal, as one expects 
from an off-axis beam, is helpful in separating signal and background. 

The major problem with this design is that off axis beams have sub- 


(5.35) 


stantially lower flux. The flux scales!?° as: 
2y i 2 


In this equation, A is the area of the detector and z is the distance to the 
detector. Two future long baseline experiments, NOvA4 and T2K"! are 
proposing off-axis beams for a v, — Ve search. Because of the low flux, 
very large detectors are required. 
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Fig. 5.18. Neutrino energy versus angle off-axis for various values of pion energy. In 
this example one can see that for moderate off-axis angles, between 15-30 mrad, all pion 
energies between 3 and 20 GeV map to approximately 1 GeV neutrino energy. 


In summary, the path to a test for non-zero 6 is clear but will take 
several steps and requires some luck. First, a clean measure 413 from Double 
Chooz and Daya Bay is needed. If sin? 20,3 < 0.005 at 90% CL, then a 
significant measurement of CP violation is unlikely to be possible in the 
near future. At the same time, improvements in the 623 and Am3, from 
disappearance (v, — V„) measurements at MINOS and T2K will improve 
the situation. T2K and NOvA*? may be able to make a first exploration 
of CP parameter space, from ve appearance measurements, depending on 
statistics. NOvA may also be able to address the mass hierarchy question. 
This will open up the possibility of measuring CP violation to the next 
generation of very long baseline experiments.'42 The most sensitive of 
these use a beam originating at Fermilab and a LAr detector located at 
Ash River, Minnesota*® or a Cerenkov or LAr detector located at the Deep 
Underground Science Laboratory at Homestake.!43 

At some point in the future, a beta beam or a neutrino factory beam 
could provide an intense source of ve and Pe fluxes, allowing comparison 
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of Ve —> Vu to Ve — Du. In this case one would search for events with 
wrong-sign muons in a calorimeter-style detector. This would be a striking 
signature with low background, especially in the case of a beta beam. This 


could allow a very precise measurement of 8.144 


5.3.2. Neutrinos and the Unexpected 


While it is nice to have a tidy, well-motivated theory of neutrino masses, it 
is disconcerting to have essentially no experimental evidence for this theory. 
Moreover, neutrino theories have a history of being incorrect. Only a decade 
ago, most theorists would have told you that neutrinos have no mass. Those 
who thought neutrinos might have mass believed it would be relatively large 
(> 5 eV), explaining dark matter. Most theorists also believed that if the 
solar neutrinos were experiencing oscillations, the correct solution would be 
the small mixing angle MSW solution, because the mixing matrix should 
look like the quark matrix. Using the same logic, the atmospheric neutrino 
deficit, which could only correspond to large mixing angle, was routinely 
dismissed as an experimental effect. 

On the basis of this, it is wise not to constrain ourselves to the “New 
Paradigm.” The reason the APS neutrino study chose to devote a chapter 
to “the Unexpected” was to emphasize the importance of being open to 
what nature is telling us about neutrinos. There are two ways to approach 
this idea: (1) the theory-driven approach: explore for properties which 
could, theoretically, exist and (2) the experiment-driven approach: follow 
up on anomalous results which have been observed. 

For lack of time, I will only briefly consider two examples of the first case: 
searching for a neutrino magnetic moment and searching for CPT violation. 
In the Standard Model, the neutrino magnetic moment is expected to be ~ 
1071 ug. Laboratory experiments and astrophysical limits are many orders 
of magnitude away from this level.‘4° Nevertheless, if a new experiment 
could advance this measurement by an order of magnitude, that would be 
worth pursuing. A more startling discovery would be a difference in the 
oscillation disappearance probability of neutrinos versus antineutrinos. In a 
three-neutrino model, a difference in the rate of disappearance of neutrinos 
and antineutrinos would imply CPT violation. MINOS will be the next 
experiment to pursue such a search.’ If CPT violation were discovered 
we would need to rethink the very basis of our theory. However, there are 
theorists exploring these ideas. 

The remainder of this section will focus on the second approach, explor- 
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ing “anomalies” which have appeared in various experiments. Physicists 
today are always cautious about pursuing deviations from the Standard 
Model. Most do not, in the end, point to new physics. The Standard 
Model has been very resilient Most anomalies are arguably more likely due 
to systematic effects or statistical fluctuations, than to new physics. How- 
ever, those which do “pan out” completely change the way we think. The 
solar neutrino deficit is a perfect example. So, if a new, unexpected result 
withstands questions by the community on the systematics of the experi- 
ment, then the anomaly becomes worth pursuing further. 

There are several examples of > 30 unexpected results in the neutrino 
sector which are worth pursuing and two cases are covered here. The first, 
the LSND anomaly, is being actively pursued. The second, the NuTeV 
anomaly, will require a new experiment. Unlike most of the topics in these 
lectures, the NuTeV anomaly is not directly related to neutrino oscillations 
and neutrino mass, and so expands the discussion, which has so far been 
rather narrowly focussed. 

Along with the known discrepancies which have reached the level of 
full-fledged anomalies, there are also examples of “unexpected results to 
watch.” These results which have not yet reached the 30 level, but are 
showing interesting trends. For example, unconstrained fits to atmospheric 
oscillation data from a wide range of experiments consistently result in 
sin? 2693 best fit value greater than unity. While in each case, the best fit is 
~ lo from unity, it is the trend which is interesting, since the experiments 
involved are all very different. There is simply not enough space to cover 
this and other examples of “results to watch.” 

The take-away message of this section is: the neutrino sector is a rich 
place for new physics to appear, and physicists need to be alert and open- 
minded to what nature is saying. 


5.3.2.1. The LSND Anomaly 


The LSND experiment ran at the LAMPF accelerator at Los Alamos Na- 
tional Laboratory between 1993 and 1998. The decay-at-rest (DAR) beam 
was produced by impinging 800 MeV protons on a beam dump. These 
produced z+s which stop and decay to produce pts, which also stop and 
decay to produce y, and ve. These were studied in the range of 20 to 55 
MeV. The z~s capture, so the beam has a < 8 x 1074 contamination of 
De. The neutrino events were observed in a detector located 30 m down- 
stream of the beam dump. 1220 phototubes surrounded the periphery of a 
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Fig. 5.19. Left: LSND allowed range compared to short baseline experiment limits. 
Right: Allowed range from the Karmen-LSND joint analysis. 149 


cylindrical detector filled with 167 tons of mineral oil, lightly doped with 
scintillator. The signature of a De appearance was De + p — etn. This 
resulted in a two-component signature: the initial Cerenkov and scintilla- 
tion light associated with the e”, followed later by the scintillation light 
from the n capture on hydrogen, producing a 2.2 MeV y. The experiment 
observed 87.9 + 22.4 + 6.0 events, a 40 excess!*° above expectation. 
LSND is a short baseline experiment, with an L/E ~ 1 m/MeV. Thus, 
from the two-generation oscillation formula, Eq. 5.10, one can see that this 
experiment is sensitive to Am? > 0.1 eV?. Other experiments have searched 
for oscillations at high Am?, and the two most relevant to LSND are Kar- 
men!4” and Bugey.!4° The KARMEN experiment, which also used a DAR 
muon beam and was located 17.7 m from the beam dump, had sensitivity 
to address only a portion of the LSND region, and did not see a signal 
there. Since the design of the experiments are very similar, one can think 
of the Karmen experiment as a “near detector,” which measures the flux 
before oscillation. The results were combined in a joint analysis performed 
by collaborators from both experiments; the allowed range for oscillations 
is shown in Figure 5.19.49 The Bugey experiment was a reactor-based De 
disappearance search which set a limit on oscillations. Because this is dis- 
appearance and not explicitly Pe — Dp, its limit is applicable to LSND in 
many, but not all, oscillation models. This limit is shown in Figure 5.19. 


Why can’t we fit LSND into the three-neutrino theory? The 
LSND signal cannot be accommodated within the standard three-neutrino 
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picture, given the solar and atmospheric oscillations. To see the incompat- 
ibility, first consider the case where the oscillation signals de-couple into, 
effectively, two-generation oscillations (Eq. 5.10). For three generations, 
then Am; = Am}, + Am3,, which is clearly not the the case for these 
three signals. The more general case allows the atmospheric result to be 
due to a mixture of high (LSND-range) and low (solar-range) Am? values. 
In order for this model to succeed, the atmospheric Am? from a shape 
analysis must shift up from its present value of ~ 2 x 107? eV? and the 
chlorine experiment must have overestimated the deficit of Be solar neu- 
trinos. However, the largest clash between data and this model arises from 
the Super-K ve events. This model requires that Super-K has missed a ve 
appearance signal of approximately the same size and shape as the v, deficit 
before detector smearing and cuts. Neutrino measurements are experimen- 
tally difficult and parameters do sometimes shift with time as systematics 
are better understood, but it seems unlikely that all of the above results 
could change sufficiently to accommodate LSND. 


Sterile Neutrinos as a Solution Additional neutrinos which do not 
interact via exchange of W or Z are called “sterile;” they may mix with 
active neutrinos, and thereby can be produced in neutrino oscillations. Ex- 
perimental evidence of this would be the disappearance of the active flavor 
from the beam. In contrast to the GUT-scale sterile neutrinos we have 
already discussed, the sterile neutrinos which could explain LSND must be 
light (in the eV range), and this narrows the class of acceptable theories. 
Nevertheless, a number of possible explanations remain.!°° 

Sterile neutrinos solve the LSND problem by adding extra mass split- 
tings. The additional mass states must be mostly sterile, with only a small 
admixture of the active flavors in order to accommodate the limits on ster- 
ile neutrinos from the atmospheric and solar experiments. In principle, one 
might expect three sterile neutrinos. In practice, the data cannot constrain 
information on more than two sterile neutrinos. Therefore these are called 
“3+2” models. The method for fitting the data is described in reference. 151 
One is fitting for two additional mass splittings, Am?, and Am?;. In the 
fit, the three mostly active neutrinos are approximated as degenerate. The 
mixing matrix is also expanded by two rows and two columns. 

The data that drive the fits are the “short baseline” experiments that 
provide information on high Am? oscillations, summarized in Table 5.5. 
The combination of P, —> De (LSND,“° Karmen I1), vp — ve (NO- 
MAD?»?), v, disappearance (CDHS,? CCFR84!%*), and ve disappearance 


228 J. M. Conrad 


Table 5.5. Results used in 3 + 2 fit. sin? 20 limit is 90% CL 


Channel Experiment Lowest sin“ 20 Best reach in 
Am? at high Am? sin? 20 
Vu — Ve LSND 0.03 eV? >25x 1073 | >12x 10-3 
KARMEN 0.06 eV? | <1.7x 1073 | <1.0x 1073 
NOMAD 0.4 eV? <14x 1073 | <1.0~x 107% 
Ve disappearance Bugey 0.01 eV? <14x 107! | <1.3 x107 
Chooz 0.0001 eV? | < 1.0 x 107} | < 5x107? 
Vu disappearance CCFR84 6 eV? NA <2x 1071 
CDHS 0.3 eV? NA <5.3x 1071 


(Bugey,/48 CHOOZ?°) must all be accommodated within the model. A con- 
straint for Super K vy, disappearance is also included. None of the short 
baseline experiments except for LSND provide evidence for oscillations be- 
yond 30. However, it should be noted that CDHS has a 2ø (statistical 
and systematic, combined) effect consistent with a high Am? sterile neu- 
trino when the data are fit for a shape dependence, and Bugey has a lo 
pull at Am? ~ 1eV?. As a result, these two experiments define the best 
fit combination of high and low Am? for the 3+2 model. However, there 
are acceptable solutions with a combination of low Am? values. The best 
fit,°! has Am?, = 0.92 eV?, Am?;=22 eV?, although there are combi- 
nations which work with two relatively low Am? values. A wide range of 
mixing angles can be accommodated, and the best fit has Ueq = 0.121, 
U„4 = 0.204, Ues = 0.036 and U„4 = 0.224. The other mixing angles 
involving the sterile states are not probed by the v,, disappearance, Ve dis- 
appearance and V, — Ve appearance experiments listed above. There is 
30% compatibility for all other experiments and LSND. 

Introducing extra neutrinos, including sterile ones, would have cosmo- 
logical implications, compounded if the extra neutrinos have significant 
mass (>1 eV). However, there are several ways around the problem. The 
first is to note that while the best fit requires a high mass sterile neutrino, 
there are low-mass fits which work within the 3+2 model. The second is to 
observe that there are a variety of classes of theories where the neutrinos 
do not thermalize in the early universe.!?? In this case, there is no conflict 
with the cosmological data, since the cosmological neutrino abundance is 
substantially reduced. 

If more than one Am? contributes to an oscillation appearance signal, 
then the data can be sensitive to a CP-violating phase in the mixing matrix. 
Experimentally, for this to occur, the Am? values must be within less than 
about two orders of magnitude of one another. 
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In 3+2 CP-violating models:1°° 


(-) (-) ; 
P(Yyi Ve) = A|Uea|?|U yal? sin? £41 


+ 4|Ue5|?|Uys|? sin? £51 


+ 8|Ue4||U p4] |Ues||U p5 | 


sin £41 Sin 251 Cos(54 F $54), (5.37) 


where in the last line, the negative sign is for neutrino oscillations and the 
positive sign is for antineutrino oscillations, and defined: 


Lyi = 1.27Am4;L/E, Q54 = arg(U ¿4U p4Ue5U is). 


Thus the oscillation probability is affected by CP violation through the 
term @54. The CP conserving cases are ¢54 = 0 and 180 degrees. 


MiniBooNE First Results The main purpose of the MiniBooNE ex- 
periment was to resolve the question of the LSND signal. First results of 
this experiment, presented in April, 2007, considered those explanations 
with a high expectation for v, — ve oscillations. This includes the CP con- 
serving 3+2 model and many cases of CP violating 3+2 models described 
above. As will be described below, the first results are incompatible with 
Vy, — Ve oscillations, but show an unexpected low energy excess, very much 
in keeping with the subsection title of “Neutrinos and the Unexpected.” 

The MiniBooNE experiment uses the Fermilab Booster Neutrino Beam, 
which is produced from 8 GeV protons incident on a beryllium target lo- 
cated within a magnetic focusing horn. The current of the horn can be 
reversed such that the beam is dominantly neutrinos or antineutrinos. The 
first results are from neutrino running. The MiniBooNE detector is located 
L = 541 m from the primary target, and the neutrino flux has average 
energy of ~ 0.75 GeV. The detector is located 541 m from the front of the 
beryllium target and consists of a spherical tank of radius 610 cm that is 
covered on the inside by 1520 8-inch photomultiplier tubes and filled with 
800 tons of pure mineral oil (CH2). Neutrino events in the detector produce 
both Cerenkov and scintillation light. 

In order to test the LSND result, the MiniBooNE design maintains 
L/E ~ 1 m/MeV while substantially changing the systematic errors as- 
sociated with the experiment. This is accomplished by increasing both L 
and E by an order of magnitude from the LSND design. This changes 
the source of the neutrinos (v, from energetic pions rather than D, from 
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stopped muons), the signature for the signal, and the major backgrounds in 
the detector. In its first run, in neutrino mode, MiniBooNE collected over 
a million clean, neutrino events. About 99.5% of the MiniBooNE neutrino 
events are estimated to be v,,-induced, while 0.5% are estimated to be due 
to “intrinsic” Ve background in the beam. 

The initial MiniBooNE results were analyzed within an appearance-only, 
two neutrino oscillation context. While the LSND signal must be a result of 
a more complex oscillation model, in most cases a Vv, — Ve-like oscillation 
signal is predicted. After the complete Ve event selection is applied, the 
total background was estimated to be 358 +35 events, while 163 +21 signal 
events were expected for the LSND central expectation of 0.26% v, —> ve 
transmutation. 

The top plot of Fig. 5.20 shows candidate ve events as a function of 
reconstructed neutrino energy (EQF). The vertical dashed line indicates 
the minimum £2” used in the two-neutrino oscillation analysis. There is no 
significant excess of events (22+19+35 events) for 475 < EQF < 1250 MeV; 
however, an excess of events (96417420 events) is observed below 475 MeV. 
In the top plot, the points show the statistical error, while the histogram 
is the expected background with systematic errors from all sources. The 
background subtracted excess as a function of E?” is shown in the bottom 
plot, where the points represent the data with total errors. Oscillation 
scenarios are indicated by the histograms. 

The low-energy excess cannot be explained by a two-neutrino oscillation 


model, and its source is under investigation. The low energy events isolated 
by the cuts, including the excess events, are single-ring and electromagnetic- 
like, with no unusual detection issues. The low energy excess events are nei- 
ther consistent with the spatial nor energy distributions of photons coming 
from interactions outside of the tank. Nor are they consistent with the en- 
ergy distribution from single photons from radiative A decays (A —> N+7). 
Mis-identification of n° events is well constrained in MiniBooNE by the rate 
of reconstructed 7° events, studied as a function of m momentum.!°° This 
rate would need to be mis-measured by well over a factor of three to explain 
the excess, far outside of the error of the analysis. 

With that said, as shown in Fig. 5.20, the excess is not in agreement 
with a simple two-neutrino V, — ve oscillation signal. This figure shows 
the predicted spectrum when the best-fit two-neutrino oscillation signal is 
added to the predicted background. The bottom panel of the figure shows 
background-subtracted data with the best-fit two-neutrino oscillation and 
two oscillation points from the favored LSND region. 
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Fig. 5.20. The top plot shows the number of candidate ve events as a function of BOF, 
Also shown are the best-fit oscillation spectrum (dashed histogram) and the background 
contributions from v, and ve events. The bottom plot shows the number of events 
with the predicted background subtracted as a function of Bee. The two histograms 
correspond to LSND solutions at high and low Am?. 


A single-sided raster scan to a two neutrino appearance-only oscillation 
model is used in the energy range 475 < E2” < 3000 MeV to find the 90% 
CL limit corresponding to Ay? = Xiimit — Xbestfit = 1-64. As shown by 
the top plot in Fig. 5.21, the LSND 90% CL allowed region is excluded 
at the 90% CL. A joint analysis as a function of Am?, using a combined 
xX? of the best fit values and errors for LSND and MiniBooNE, excludes 
at 98% CL two-neutrino appearance-only oscillations as an explanation of 
the LSND anomaly. The bottom plot of Fig. 5.21 shows limits from the 
KARMEN!" and Bugey!® experiments. This is plot represents an exam- 
ple of the problem of apples-to-apples comparisons raised in Sec. . The 
MiniBooNE and Bugey curves are 1-sided upper limits on sin? 20 corre- 
sponding to Ay? = 1.64 — hence directly comparable — while the published 
KARMEN curve is a “Feldman Cousins” contour. 
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Fig. 5.21. The top plot shows the v, — ve MiniBooNE 90% CL limit (thick solid curve) 
and sensitivity (dashed curve) for events with 475 < EQF < 3000 MeV. Also shown is 
the limit from a second cross-check analysis (thin solid curve). The bottom plot shows 
the limits from the KARMEN" and Bugey!® experiments. The shaded areas show 
the 90% and 99% CL allowed regions from the LSND experiment. 


Next Steps From the initial MiniBooNE result, one can draw two con- 
clusions: (1) there is excellent agreement between data and prediction in 
the analysis region originally defined for the two-neutrino oscillation search 
and (2) there is a presently unexplained discrepancy with data lying above 
background at low energy. This combination of information severely limits 
models seeking to explain the LSND anomaly. 

Interpreting the MiniBooNE data as appearance-only and combining 
this result with other data in a 3+2 fit does not give a satisfactory re- 
sult.157 While MiniBooNE and LSND are compatible if CP violation is 
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allowed in the 3+2 model, there is tension between these results and the 
v, disappearance experiments. This might be addressed if a 3+2 interpre- 
tation of the MiniBooNE result were expanded to include the possibility 
of v,, disappearance and intrinsic Ve disappearance. This analysis is un- 
derway by the MiniBooNE collaboration.!8 Most likely, if a good fit is 
obtained in a 3+2 scenario, it will require some level of CP violation. Mini- 
BooNE is presently collecting data in antineutrino mode. However, this is 
a small data set (~ 2 x 107° protons on target producing the beam) and 
future running to reach roughly three times the statistics will be required to 
make a decisive statement. Other, alternative explanations are also being 
explored!5? 16! , 

An upcoming result which will shed light on the question is the analysis 
of the MiniBooNE data from the NuMI beam. This beam is 110 mrad off- 
axis, with a m peak of average v, energy of about 200 MeV and a K peak 
of about 2 GeV, and a length of 750 m. If an excess of events is observed 
in this analysis, this rules out mis-estimate of intrinsic Ve in the Booster 
Neutrino Beam as the source of the MiniBooNE excess. Results from this 
study are expected in autumn, 2007. 

If the unexplained excess persists after the above studies, then it will be 
valuable to introduce a detector which can differentiate between electrons 
and photons. That is the goal of MicroBooNE,“ which uses a Liquid argon 
TPC (LArTPC) detector. This is particularly sensitive at low energies 
and nearly background-free. Specifically, this detector has a ve efficiency 
> 80% and rejects photons efficiently through dE/dx deposition in the first 
~2 cm of the shower. With these qualities the detector can be an order 
of magnitude smaller in size than MiniBooNE, making quick construction 
feasible. A proposal for this experiment will be submitted to the Fermilab 
PAC in autumn 2007. 


5.3.2.2. The NuTeV Anomaly 


Neutrino scattering measurements offer a unique tool to probe the elec- 
troweak interactions of the Standard Model (SM). The NuTeV anomaly is 
a 30 deviation of sin? @w from the Standard Model prediction. 16? 
parameterizes the mixing between the weak interaction Z boson and the 
photon in electroweak theory. Deviations of measurements of this param- 


sin? Ow 


eter, and its partner parameter, p, the relative coupling strength of the 
neutral-to-charged-current interactions, may indicate Beyond-Standard- 
Model physics. This section also highlights that fact that new neutrino 
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Table 5.6. left and right handed coupling constants. 


properties may be revealed in TeV-scale interactions at LHC, which has 
not been addressed previously. 

The NuTeV experiment represents a departure from the previous train 
of thought in several ways. NuTeV was a deep inelastic neutrino scattering 
experiment, and thus is performed at significantly higher energy than the 
experiments previously discussed. Also, while NuTeV did an oscillation 
search, it was mainly designed for another purpose: precision measurement 
of electroweak parameters. We will focus on that purpose here. As a result, 
this analysis allows new issues related to neutrino physics to be brought into 
the discussion. 


sin? Ow in Neutrino Scattering and Other Experiments 

In neutrino scattering, the neutral current cross section depends upon 
sin? Ow. The dependence is a function of the neutrino flavor and the target. 
NuTeV was a muon-neutrino-flavor scattering experiment. In this case, the 
NC cross sections for scattering from a light fermion target are: 


-2 
do(vuf > vaf) _ Gis (2 +71 — y)?) 4 2) , (5.38) 


dy Mz 
do(a f > yf) _ GFS 159 2,2 sy B 


In this equation, f is the type of light fermion: f = e~,u,d,s,c. Several 
constants appear: Gp is the Fermi constant, Mz is the mass of the Z. 
The two kinematic variables are: s, the effective center of mass energy, 
which depends on the mass of f and y, the inelasticity (see definitions in 
Sec. 5.1.2). €f, ry are left and right handed coupling constants which are 
given in Table 5.6. 

While neutrino scattering has traditionally been a method for measur- 
ing sin? Ow, the “Standard Model Prediction” quoted in literature comes 
from the very precise measurements made by the LEP and SLD experi- 
ments, which have been summarized by the Electroweak Working Group.!® 
sin? ðw appears in various measurements from e+e~ scattering at the 
Z pole. An example which leads to a highly precise measurement is 
the “left-right asymmetry” measured from polarized scattering at SLD: 
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Apr = (or —or)/(oL + oR) where oy and op refer to the scattering cross 
sections for left- and right- polarized electrons, respectively. In this case 
the asymmetry is given by: 


2 
(4 — sin? ae") — siní g(ef) 


Atr(Z°) = ~_____~ ,—___, 
(4 — sin? Oy (2))” + sint a 


(5.40) 

When comparing quoted values of the weak mixing angle sin? Oy, care 
must be taken because this parameter is defined in various ways. The 
simplest definition is the “on shell” description: 


1 — M2, /M2 = sin? 067-50 (5.41) 


This is the definition commonly used in neutrino physics. In the discussion 
which follows, if not explicitly labeled, the on-shell definition for sin? Oy is 
used. A variation on this definition uses the renormalized masses at some 
arbitrary scale u which is usually taken to be Mz: 


1 — My(u)?/Mz(u)? = sin? 0M), (5.42) 
However, the LEP experiments used the “effective” weak mixing angle 
which is related to the vector and axial vector couplings: 


1 : e 
rio — gł / 94) = sin? ger) (5.43) 


This is what appears in Eq. 5.40 above. One must convert between defi- 
nitions, which have different radiative corrections and renormalization pre- 
scriptions, in order to make comparisons. 

The parameter sin? Oy evolves with Q?, the squared 4-momentum trans- 
fer of the interaction. Fig. 5.22 illustrates this evolution. The highest Q? 
measurements are from LEP and SLD, with Q? = M2. There are sev- 
eral types of experiments, including neutrino experiments, which measure 
sin? Ow with Q? < m3. NuTeV was performed at Q? = 1 to 140 GeV?, 
(Q2) = 26 GeV’, (Q2) = 15 GeV’. The lowest Q? measurements are from 
studies of atomic parity violation in the nucleus'®* (APV), which arises 
due to the electroweak interference of the photon and the Z in the boson 
exchange between the electrons and the nucleus. This samples Q? ~ 0. 
At higher Q?, there is the result from SLAC E158, a Moller scattering ex- 
periment at average Q? = 0.026 GeV?.!® Using the measurements at the 
Z-pole with Q? = M2 to fix the value of sin? Oy, and evolving to low Q?, 
Fig. 5.22166 shows that APV and SLAC E158 are in agreement with the 
Standard Model. 
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Fig. 5.22. Measurements of sin? @w as a function of Q.166 The curve shows the Stan- 
dard Model expectation. 


NuTeV is strikingly off the prediction of Fig. 5.22. Neutrino scatter- 
ing may measure a different result because new physics enters the neutrino 
process differently than the other experiments. Compared to the collid- 
ers, neutrino physics measures different combinations of couplings. Also 
neutrino scattering explores new physics through moderate space-like mo- 
mentum transfer, as opposed to the time-like scattering at the colliders. 
With respect to the lower energy experiments, the radiative corrections 
to neutrino interactions allow sensitivity to high-mass particles which are 
complementary to the APV and Moaller-scattering corrections. 


The NuTeV Result The NuTeV experiment provides the most precise 
measurement of sin? 0w from neutrino experiments. The measurement re- 
lied upon deep inelastic scatter (DIS). It was performed using a “Paschos- 
Wolfenstein style” 167 
atic errors which come from our understanding of parton distributions and 


analysis which is designed to minimize the system- 


masses. 
This method requires separated v and P beams. In this case, the fol- 
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lowing ratios could be formed: 


RY = CNC (5.44) 
CEG 

R? = CNG (5.45) 
ICC 

(5.46) 


Paschos and Wolfenstein!® recast these as: 
eee (5.47) 


where r = oGc/oéc. In the case of R~, many systematics cancel to 
first order. In particular, the quark and antiquark seas for u,d,s, and 
c, which are less precisely known than the valence quark distributions, will 
cancel. Charm production only enters through dyatence which is Cabbibo 
suppressed and at high x, thus the error from the charm mass is greatly 
reduced. One can also form Rt, but this will have much larger systematic 
errors, and so the strength of the NuTeV analysis lies in the measurement 
of R. 

According to the “Paschos-Wolfenstein” method, an experiment should 
run in neutrino and antineutrino mode, categorize the events as CC or 
NC DIS, and then form R~ to extract sin? 0w. This requires identifying 
the CC or NC events properly in NuTeV’s iron-scintillator /drift-chamber 
calorimeter. Most CC DIS events have an exiting muon, which causes a 
long string of hits in the scintillator and are therefore called “long.” Most 
NC DIS events are relatively “short” hadronic showers. However, there 
are exceptions to these rules. A CC event caused by interaction of an 
intrinsic ve in the beam will appear short. An NC shower which contains a 
pion-decay-in-flight, producing a muon, may appear long. The connection 
between long vs. short and CC vs. NC must be made via Monte Carlo. 

NuTeV measurement is in agreement with past neutrino scattering re- 
sults, although these have much larger errors. However, the NuTeV result 
is in disagreement with the global fits to the electroweak data which give a 
Standard Model value of sin? Oy = 0.2227.!62 


Explanations In the case of any anomaly, it is best to start with the 
commonplace explanations. Three explanations for the NuTeV anomaly 
that are “within the Standard Model” have been proposed: the QCD-order 
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of the analysis, isospin violation, and the strange sea asymmetry. The 
NuTeV analysis was not performed at a full NLO level. However, the effect 
of going to NLO on NuTeV can be estimated,!®* and the expected pull 
is away from the Standard Model. The NuTeV analysis assumed isospin 
symmetry, that is, u(x)? = d(x)” and d(x)P = u(x)”. Various models for 
isospin violation have been studied and their pulls range from less than 1o 
away from the Standard Model to ~ lo toward the Standard Model.169 
Variations in the strange sea can either pull the result toward or away from 
the Standard Model expectation,!®? but not by more than one sigma. 

With respect to Beyond-Standard-Model explanations, Chapter 14 of 
the APS Neutrino Study White Paper on Neutrino Theory? is dedicated to 
“The Physics of NuTeV” and provides an excellent summary. The discus- 
sion presented here is drawn from this source. 

The NuTeV measurements of R” and R”, the NC-to-CC cross sections, 
are low compared to expectation. For this to be a Beyond-Standard-Model 
effect, it therefore requires introduction of new physics that suppresses the 
NC rate with respect to the CC rate. Two types of models produce this 
effect and remain consistent with the other electroweak measurements: (1) 
models which affect only the Z couplings, e.g., the introduction of a heavy 
Z’ boson which interferes with the Standard Model Z; or (2) models which 
affect only the neutrino couplings, e.g., the introduction of moderate mass 
neutral heavy leptons which mix with the neutrino. 

Any Z’ model invoked to explain NuTeV must selectively suppress NC 
neutrino scattering, without significantly affecting the other electroweak 
measurements. This rules out most models, which tend to increase the 
NC scattering rate. Examples of successful models are those where the Z’ 
couples to B — 3L,!” or to L, — £7.17 

Moderate-mass neutral heavy leptons, a.k.a. “neutrissimos,” can also 
produce the desired effect. Suppression of the coupling comes from inter- 
generational mixing of heavy states, so that the v, is a mixture: 


Vy, = (COS @)Mignt + (Sin a)Vneavy- (5.48) 


The Zv,,v,, coupling is modified by cos? a and the W uv, coupling is mod- 
ified by cosa. Neutrissimos may have masses as light as ~ 100 GeV.17 
These new particles can play the role of the seesaw right-handed neutrinos, 
as long as one is willing to admit large tuning among the neutrino Yukawa 
couplings.'’? So this offers an alternative to the GUT-mass heavy neutrino 
model discussed in Sec. 5.3.1. 
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If neutrissimos exist, they would be expected to show up in other pre- 
cision experiments. One must avoid the constraints on mixing from 0v pp 
(recall Eq. 5.31 to see why these experiments have sensitivity to the mix- 
ing). These experiment place a limit of |U.4|? at less than a few x10~° 
for a 100 GeV right-handed neutrino. Rare pion and tau decays constrain 
|U,,4|? to be less than 0.004 and |U;4|? to be less than 0.006, respectively. 

Neutrissimos would be produced at LHC, thus neutrino physics can 
be done at the highest energy scales! However, they may be difficult to 
observe. One would naturally look for a signal of missing energy. However, 
neutrissiomos will not necessarily decay invisibly; for example one can have 
N — €+W and the W may decay to either two jets or a neutrino—charged- 
lepton pair. Only the latter case has missing energy. This may make them 
difficult to identify. 

If the neutrissimo is a Majorana particle, then these could provide a 
clue to the mechanism for leptogenesis. The present models of leptogenesis 
require very high mass scales for the neutral lepton. However, theorists 
are identifying ways to modify the model to accommodate lower masses.!”? 
There also may be a wide mass spectrum for these particles, with one 
very heavy case that accommodates standard leptogenesis models, while 
the others have masses in the range observable at LHC.1”4 


NuSOnG and Other Possibilities A new round of precision elec- 
troweak measurements can be motivated by the NuTeV anomaly as well 
as the imminent turn-on of LHC. These measurements are best done us- 
ing neutrino-electron scattering, because this removes the quark-model re- 
lated questions discussed in the previous section. Two possible methods 
for such a measurement are v, scattering with higher statistics, using a 
NuTeV-style beam, or a Pes scattering measurement from a reactor. In 
either case, to provide a competitive measurement, the error from the best 
present neutrino-electron scattering measurement, from CHARM II, must 
be reduced by a factor of five. 

The NuSOnG (Neutrino Scattering On Glass) Experiment!”°is proposed 
to run using a v, beam produced by 800 GeV protons on target from the 
TeVatron. The plan is to use a design which is inspired by the CHARM 
II experiment: a target of SIO2 in one quarter radiation length panels, 
with proportional tubes or scintillator to allow event reconstruction. The 
detector will have a 2.6 kton fiducial volume. The major technical challenge 
of such an experiment is in achieving the required rates from the TeVatron, 
as X20 the rate of NuTeV proton delivery is required. 
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Alternatively, a measurement of the weak mixing angle using anti- 
neutrinos from reactors may be possible.'”° The weak mixing angle can 
be extracted from the purely leptonic Pee “elastic scatter” (ES) rate, which 
is normalized using the Dep “inverse beta decay” (IBD) events, to reduce 
the error on the flux. Thus, a hydrocarbon (scintillator oil) based detector, 
which has free proton targets for the IBD events, is ideal. Gadolinium (Gd) 
doping is necessary for a high rate of neutron capture, which constitutes 
the signal for the IBD events. A window in visible energy of 3 to 5 MeV 
is selected to reduce backgrounds from contaminants in the oil and cosmic- 
muon-induced isotopes. In this energy range, the dominant contamination 
comes from the progeny of the uranium and thorium chain. This would 
clearly be an ambitious, state-of-the-art measurement, but could be done 
at a new reactor experiment where the detector is in close proximity to the 
source and had high shielding from cosmic rays. 


5.3.3. Neutrinos and the Cosmos 


Neutrinos are ubiquitous in the universe, and their presence and inter- 
actions must be incorporated into astrophysical and cosmological models. 
Nearly any new neutrino property will have direct consequences in these 
fields, which must be examined. As an illustrative example, the first discus- 
sion considers the impact of introducing of relatively light sterile neutrinos 
to the theory. Sterile neutrinos with keV-scale masses can explain dark 
matter as well other astrophysical questions. 

As we improve our capability for detecting astrophysical neutrinos, these 
become a new source of neutrinos for study. The second example is a case in 
point: the search for ultra-high energy sources of neutrinos. The discovery 
of such sources would be of great interest to astrophysics, and the particle 
physics we can do with such a “beam” is remarkable. 

While this section concentrates on the unknown, it is interesting to note 
that the known astrophysical sources of neutrinos are sufficiently intense 
that these neutrinos are already a possible background to other physics 
measurements. An example is the case of dark matter searches, which aim 
to measure cross sections as small as 10746 cm?. These experiments will 
have to contend with the background from coherent scattering (v + N —> 
v+ N) of solar neutrinos, which has a cross section of 10~°° cm? and which 
produces a recoil nucleon that is very much like the expected dark matter 
signal.177 
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5.3.3.1. Neutrinos as Dark Matter 


From the mid-1980’s through mid-90’s a 5 eV v, was considered a likely 
candidate for dark matter. This was the motivation for the Chorus and 
NOMAD search for v, —> v, oscillations in the > 10 eV? range!7®:17° as 
well as the proposed COSMOS experiment.!*° 

In the late 90’s and early 2000’s, two measurements led to a shift in 
opinion about neutrinos as candidates for dark matter. The first was the 
Super-K confirmation of v, oscillations. As discussed in Sec. 5.2.1.4, the 
cleanest explanation, which fits within a three-neutrino model, is that this 
effect is vy, — V+. Combining this information with the direct limit on the ve 
implied that neutrinos were unlikely to have masses in the 5 to 10 eV range, 
as required for dark matter. Also, at the same time, studies of the large 
scale structure of the universe indicated that dark matter must be non- 
relativistic, or “cold.” Relativistic, or “hot dark matter,” like neutrinos, 
would smooth the large scale structure far beyond observations. 181 

The idea of neutrinos as dark matter candidates fell into disfavor. For 
some time, the more likely solution was assumed to be WIMPs, Weakly 
Interacting Massive Particles. The ‘ 
fusing, since it does not refer strictly to the weak interaction — other Beyond- 
Standard-Model interactions are involved. It is simply meant to say that 
the interaction rate is very low. 

For some time, the lightest supersymmetric particle has been the most 


favored candidate for the WIMP. However, this is now starting to be ques- 
182 


‘weak” in this name is somewhat con- 


tioned, as no evidence for supersymmetry has been observed at colliders. 
This makes the formulation of the theory more awkward, and SUSY expla- 
nations for dark matter have been pushed from the “Minimal Super Sym- 
metric Model” to the “Next-to-Minimal Super Symmetric Model,” and even 
this is challenged.'®* If supersymmetry does not show up at LHC, then a 
new explanation for dark matter must be found. 

As a result, neutrino models are being reconsidered.!** 1? The new 
models involve neutrinos which are mostly sterile, with a very tiny mixing 
with the light flavors and with keV masses (0.5keV < m, < 15keV and 
sin? 20 > 10712). Because of the high mass, they are not relativistic and are 
regarded as “warm” or “cold.” At high mass, the large scale structure limits 
are much less stringent. A recent analysis from the Sloan Digital Sky Survey 
(SDSS) finds that a sterile neutrino mass above 9 keV can reproduce the 


194 


power spectrum. Only tiny mixing with the active neutrinos is required 


in order to produce the dark matter. With such small mixings, this model 
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sin? 20 


Fig. 5.23. Bounds for ve — vs oscillations from astrophysics and cosmology. Allowed 
regions for neutrino cold, warm and hot dark matter are shown.!87 


easily evades all accelerator-based bounds on ve —> vs oscillations. As shown 
in Fig. 5.23, this model also escapes the cosmological limits on Ve —> Vs 
from the CMB measurements, from Big Bang Nucleosynthesis (BBN), and 
supernova limits (SN), assuming negligible lepton number asymmetry, L, 
in the early universe. 

Currently the only constraints on keV sterile neutrinos come from X- 
193 which are searching for evidence of radiative decay of the 
massive neutrino into a lighter state, v2 —> vı + y. This proceeds through 
loop diagrams where the photon is coupling to a W or a charged lepton in 
the loop.!®° Because this is a 2-body decay, one is searching for a spectral 
line in the x-ray region. For a Dirac-type sterile neutrinos of mass ms the 


ray astronomy 


decay rate is given by: 


* 2 2 2 5 
T (ms) = 1.36 x 10729s7? (= r) ( m ) l (5.49) 


1077 1keV 


which is clearly tiny, even for a keV scale neutrino. This is important 
as the dark matter neutrinos must be stable on the scale of the lifetime 
of the universe. No signal has been observed and the current mass limit, 
from the Chandra X-ray telescope observations, ranges from >3 to >6 keV 
depending on model assumptions. 

Having motivated a keV-mass sterile state using dark matter, one can 
explore the consequences in other areas of cosmology and astrophysics. 
The small mixing allows these neutrinos to evade bounds from big bang 
nucleosynthesis.!°° Their presence may be beneficial to models of supernova 
explosions and pulsar kicks, as discussed below. 
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The existence of these neutrinos may improve the supernova models 
substantially. The problem faced by most models is that the supernova 
stalls and fails to explode. As modest increase in neutrino luminosity during 
the epoch when the stalled bounce shock is being reheated (tpb < 1s) will 
incite the explosion.19’ As the supernova occurs, neutrinos will oscillate 
and even be affected by MSW resonances. If neutrinos oscillate to a sterile 
state, then their transport-mean-free-paths become larger. This increases 
the neutrino luminosity at the neutrino sphere and makes “the difference 
between a dud and an explosion.” 197 

Also, sterile neutrinos in the 1 to 20 keV mass range can also be used 
to explain the origin of pulsar motion. Pulsars are known to have large 
velocities, from 100 to 1600 km/s. This is a much higher velocity than 
an ordinary star which typically has 30 km/s. Pulsars also have very high 
angular velocities. Apparently, there is some mechanism to give pulsars a 
substantial “kick” at birth, which sends them off with high translational 
and rotational velocities. One explanation for the kick is an asymmetric 
neutrino emission of sterile neutrinos during or moments after the explosion 


196 


which forms the pulsar. An asymmetry in neutrino emission occurs 


because the “urca reactions” 


Vetnepte, (5.50) 
Petpontet, (5.51) 


are affected by magnetic fields which trap electrons and positrons. If the 
neutrinos oscillate to a sterile state, they stream out of the pulsar. If the 
sterile neutrinos have high mass, they can provide a significant kick. There 
are a number of solutions, either with standard neutrino oscillations, as 
described by Eq. 5.10 or with an MSW-type resonance. All require a sterile 
state in the 1 to 20 keV range with very small mixing, compatible with the 
dark matter scenario described above. 

In summary, this is an example of how introducing a new neutrino 
property, i.e. sterile companions to the known neutrinos, can have a major 
impact on astrophysical models. This is an interesting case in point, because 
it is unlikely that these neutrinos will be observable in particle physics 
experiments in the near future. At present, the only detection method is 
through X-ray emission due to the radiative decay. Thus this is, at the 
moment, an example of a neutrino property which is entirely motivated 
and explored in the context of astrophysics. 
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5.3.3.2. Ultra High Energy Neutrinos 


All of the experiments so-far discussed have used neutrino sources in the 
energy range of a few MeV to many GeV. We do not know how to pro- 
duce neutrino beams at higher energies. However, nature clearly has high 
energy acceleration mechanisms, because cosmic rays with energies of 108 
GeV have been measured. A new generation of neutrino experiments is 
now looking for neutrinos at these energies and beyond. These include 
AMANDA,*4 ICEcube,!°8 Antares,’ and Anita.!%9 

These experiments make use of the fact that the Earth is opaque to 
ultra-high energy neutrinos. The apparent weakness of the weak inter- 
action, which is due to the suppression by the mass of the W in the 
propagator term, is reduced as the neutrino energy increases. Amaz- 
ingly, when you reach neutrino energies of 10!” eV, the Earth becomes 
opaque to neutrinos. To see this, recalling the kinematic variables de- 
fined in Sec. 5.1.2, consider the following back-of-the-envelope calculation. 
For a 108 GeV v, s = 2ME, = 2 x 108 GeV?. Most interactions oc- 
cur at low x; and at these energies Ttypical ~ 0.001. For neutrino in- 


teractions, the average y is 0.5. Therefore, using Q? = sxy, we find 
typical = (2 x 10°)(1 x 10~*)(0.5) = 1 x 10° Gev?. The propagator term 
goes as 
2 2 
M2, 7 1 _ Mé, (5.52) 
M2,+Q2}  \1+Q2/M2,} Q’ 


which is approximately 107 for our “typical” case. The typical cross sec- 
tion Otypical 1S: 
E x (Stot / E) x (prop term) = 108(GeV)(0.6 x 10~*8cm?/GeV) 10-3 
= 0.6 x 107?3cm?. From this we can extract the interaction length, Ào, on 
iron, by scaling from hadronic interactions, which tells us that at 30 mb 
(= 0.3 x 107? cm?), 1A9 ~ 10 cm. This implies that ào for our very high 
energy vs is ~ 5 x 10° km. However, the Earth is a few x104 km. Thus all 
of the neutrinos interact; the Earth is opaque to them. 

This opens up the opportunity to instrument the Earth and use it as 
a neutrino target. One option is to choose a transparent region of the 
Earth, ice or water, and instrument it like a traditional neutrino detector. 
This has been the design chosen by AMANDA, ICECube and Antares, 
which use phototubes to sense the Cerenkov light produced when charged 
particles from neutrino interactions traverse the material. The largest of 
these detectors are on the order of (1 km)? of instrumented area. The 
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second method exploits the Askaryan effect? 


Electron and positron scattering in matter have different cross sections. 
As the electromangetic shower develops, this difference leads to a negative 
charge asymmetry, inducing strong strong coherent Cerenkov radiation in 
the radio range. The pulse has unique and easy-to-distinguish broadband 
(0.2 - 1.1 GHz) spectral and polarization properties which can be received 
by detecting antennas launched above the target area. In ice, the radio 
attenuation length is 1 km. The Anita Experiment, which uses such a 
detector, can view 2 x 10° km? of volume. This makes it by far the world’s 
largest tonnage experiment. 

Neutrinos with energies above 10+ GeV have yet to be observed. How- 
ever, they are expected to accompany ultra high energy cosmic rays, which 
have been observed. Nearly all potential sources of ultra-high energy cos- 
mic rays are predicted to produce protons, neutrinos, and gamma rays at 
roughly comparable levels. Ultra high energy protons, which have been ob- 
served by the HiRes?! and Auger??? experiments are guaranteed sources 
of neutrinos through the Gresein-Zatsepi-Kuzmin (GZK) interaction. In 
this effect, protons above Egzx = 6 x 10!° GeV scatter from the cosmic 
microwave background: py => At — na*. This degrades the energy of 
protons above Eg@zx, leading to an apparent cutoff in the flux called the 
“GZK cutoff.” There are several A resonances and the CMB photons have 
an energy distribution, so the cutoff is not sharp. But it has been clearly 
observed by both HiRes and Auger.?°% As a result, ultra-high energy pions 
are produced and these must decay to ultra-high-energy neutrinos. There 
may be other, more exotic mechanisms for producing an ultra-high en- 
ergy flux, possibly with energies beyond the GZK cutoff. Since, unlike the 
protons, these neutrinos do not interact with the cosmic microwave back- 


in electromagnetic showers. 


ground, they can traverse long distances and can be messengers of distant 
point sources. 
There many reviews of the exotic physics one can do with ultra high 
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energy neutrino interactions. The opportunities include 
lensing of neutrinos, the search for bumps or steps in the NC/CC ratio, the 
influence of new physics on neutrino oscillations at high energies, the search 
for neutrino decays, neutrino interaction with dark matter WIMPs, and 
the annihilation of the ultra high energy neutrinos by the cosmic neutrino 
background. This final example is interesting because significant limits 
have been set by Anita-lite, a small prototype for Anita that flew only 18.4 
days. This illustrates the power of even a small experiment entering an 
unexplored frontier of particle physics. 
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The 1 eV mass neutrino implied by the LSND anomaly could be a 
candidate for the source of the ultra-high energy cosmic rays observed on 
Earth.?°° This neutrino, if produced at ultra-high energies, could annihilate 
on the cosmic neutrino background producing a “Z-burst” of ultra high 
energy hadrons. This was offered as an explanation for ultra-high energy 
cosmic which were observed by the AGASA experiment.?°° Scaling from 
the AGASA rate, a prediction for the flux of ultra-high neutrinos in the 
energy range of 10185 < E, < 107° eV for Z-burst models was made.?0%298 
Based on this flux, Anita-lite was predicted to see between 5 and 50 events 
at >99% CL. During its short run, this prototype detector observed no 
events in the energy range and therefore could definitively rule out this 
model.?°? Shortly thereafter, the AGASA events were shown to be due to 


energy miscalibration.?!° 


5.4. Conclusions 


The goal of this review was to sketch out the present questions in neutrino 
physics, and discuss the experiments that can address them. Along the way, 
I have highlighted the experimental techniques and challenges. I have also 
tried to briefly touch on technological advances expected in the near future. 
This text followed the structure of the set of lectures entitled “Neutrino 
Experiments,” given at the 2006 TASI Summer School. 

Neutrino physics is an amalgam of astrophysics, cosmology, nuclear 
physics, and particle physics, making the field diverse and exciting, but 
hard to review comprehensively. In this paper, I have tried to touch on ex- 
amples which are particularly instructive and have been forced to leave out 
a wide range of other interesting points. What should be clear, however, is 
that the recent discoveries by neutrino experiments have opened up a wide 
range of interesting questions and opportunities. This promises to be a rich 
field of research for both theorists and experimentalists for years to come. 
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Chapter 6 


String Theory, String Model-Building, and String 
Phenomenology — A Practical Introduction 


Keith R. Dienes 


Department of Physics, University of Arizona, Tucson, AZ 85721, USA 
dienes @physics. arizona.edu 


This is the written version of an introductory self-contained course on 
string model-building and string phenomenology given at the 2006 TASI 
summer school. No prior knowledge of string theory is assumed. The 
goal is to provide a practical, “how-to” manual on string theory, string 
model-building, and string phenomenology with a minimum of mathe- 
matics. These notes cover the construction of bosonic strings, super- 
strings, and heterotic strings prior to compactification. These notes also 
develop the ten-dimensional free-fermionic construction. A final lecture 
discusses general features of heterotic string models, Type I (open) string 
models, and recent trends of string phenomenology. and general features 
of low-energy string phenomenology. 


6.0. Introduction 


These lectures were delivered at the 2006 Theoretical Advanced Study In- 
stitute (TASI), to an audience of graduate students whose interests were 
primarily oriented towards high-energy phenomenology. Indeed, this school 
had a stated focus on neutrino physics, and consequently my goal was to 
present string theory in a way that ultimately might explain how a specific 
particle such as a neutrino might ultimately emerge from string theory. Of 
course, string theory contains a lot more than neutrino physics (and also, 
in some ways, a lot less!), and in the course of these lectures I will not really 
focus so much on neutrinos as on string theory as a whole. Nevertheless, 
I will continue to keep neutrinos as a running theme throughout these lec- 
tures as a way of reminding ourselves that our discussion of string theory 
is ultimately aimed at understanding something real and observable, such 
as an actual neutrino. 
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The title of these lectures indicates that these lectures are meant to 
serve as a practical introduction to string theory, string model-building, and 
string phenomenology. Let me explain, in a rough sense, what each of these 
words is meant to convey. We are all familiar with quantum field theory, 
which is a language through which we might construct particular models 
of physics (such as the Standard Model or the Minimal Supersymmetric 
Standard Model). Such models then have certain physical characteristics, 
certain phenomenologies. String theory, at least as I shall try to present it, 
can likewise be considered as a language for discussing physics: in this sense 
it replaces quantum field theory (a language based on point-particle physics) 
with a new language suitable for theories whose fundamental objects are 
the one-dimensional extended objects known as strings. However, from this 
perspective, string theory is still only a language: it is still necessary to take 
the next step and use this language to construct models that describe the 
everyday world. Therefore, although I will attempt to give a self-contained 
introduction to the language of string theory, these lectures will primarily 
focus on the model-building aspects of string theory and on the resulting 
phenomenologies that these models have. While there already exist many 
excellent reviews of string theory, there are relatively few that focus on 
its model-building and phenomenological aspects. These lecture notes will 
therefore hopefully help to fill the gap, especially for those readers who 
might care less for the formal aspects of string theory and more for their 
phenomenological implications. 

Finally, I should explain the word “practical” which also appears in the 
title. The word “practical” refers to actual practice — the things that prac- 
titioners actually need to know in order to build bona-fide string models 
and/or comprehend their low-energy properties. Of course, string theory 
is a rich and beautiful subject, with many mathematical aspects that are 
compelling and ultimately essential for a deep understanding of the subject. 
However, the goal of these lectures is simply to present the basic features 
of string theory with a minimum of mathematics — as stated in the ab- 
stract, I am seeking to provide a “how-to” manual which cuts the subject 
to the bone and conveys only that information which will be important 
for phenomenology. Therefore, in many places the omissions will be sub- 
stantial. Certainly they do not do justice to the subject. However, these 
lectures were designed for phenomenologically-oriented graduate students 
whose desire (I hope) was to learn something of string theory without be- 
ing deluged by mathematical formalism. It is with them in mind that I 
designed these lectures to be as elementary as feasible, and to “get to the 
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physics” as rapidly as possible. Therefore, I now issue the following 


Warning: These lectures are meant to cover a consid- 
erable amount of introductory material very rapidly and 
without mathematical sophistication. The purpose is to 
advance quickly to the model-building and phenomenolog- 
ical aspects of string theory, while still conveying an in- 
tuitive flavor of the essential issues. The target audience 
consists of people who have had no prior exposure to string 
theory, and who wish to understand the basic concepts 
from a purely phenomenological perspective. 


Hopefully, the students came away with a sense that string theory is a 
real part of physics, one with direct relevance for the real world. Perhaps 
the reader will too. If so, then these lectures will have served their purpose. 


6.1. Lecture #1: Why strings? — an overview 


Why should we be interested in string theory? In this lecture, we shall 
review our present state of knowledge about the underlying constituents 
of matter, and discuss how string theory has the potential to extend that 
knowledge in a profoundly new direction. Since this lecture is meant only 
as an overview, we shall keep the discussion at an extremely superficial 
level and seek to present the intuitive flavor of string theory rather than 
its substance. We shall deal with the substance in subsequent lectures. 


6.1.1. From atoms to the Standard Model: A quick review 


Certainly we do not need to understand string theory in order to appreci- 
ate modern high-energy particle physics, or to understand or interpret the 
results of collider experiments. Why then should one study string theory, a 
subject whose connections to observable phenomena are usually considered 
rather tenuous at best? 

The primary reason, of course, is that the goal of high-energy physics 
has always been to uncover the fundamental “elements” or building-blocks 
of the natural world. These consist of both the fundamental particles that 
make up the matter, and the fundamental forces that describe their inter- 
actions. In this way, we hope to expose the underlying laws of physics in 
their simplest forms. 

But what is “fundamental”? Clearly, the answer depends on the energy 
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scale, or equivalently the inverse length scale, at which these constituents 
are being probed. In order to establish our frame of reference, recall that 
1 eV + 1.6 x 1071? Joules ~ (1077 meters)~!. At the eV scale, the fun- 
damental objects are atoms, or nuclei plus electrons. But it turns out that 
there are many different types of atoms or nuclei — indeed, they fill out 
an entire periodic table, the complexity but regularity of which suggests a 
deeper substructure. And indeed such a deeper substructure exists: at the 
keV to MeV scale, the nuclei are no longer fundamental, but decompose 
into new fundamental objects — protons and neutrons. Thus, at this en- 
ergy scale, the fundamental objects are protons, neutrons, and electrons. 
But once again, it is found that there are many different “types” of protons 
and neutrons — collectively they are called hadrons, and include not only 
the proton (p) and neutron (n), but also the pions (7), kaons (K), rho (p), 
omega (Q), and so forth. Indeed, the “periodic table of the elements” at 
this energy scale is nothing but the Particle Properties Data Book! But 
once again, the complexity and regularity of these “elementary” particles 
suggests a deeper substructure, and indeed such a substructure is found, 
this time at the GeV scale: the proton and neutron are just made of two 
kinds of quarks, the so-called up and down quarks. Thus, at the GeV scale, 
the fundamental objects are up quarks, down quarks, and electrons. But 
once again complexity emerges: it turns out that there are many different 
“types” (flavors) of quarks: up, down, strange, charm, top, and bottom. 
Likewise, there are many different “types” of electrons (collectively called 
leptons): the electron, the muon, the tau, and their associated neutrinos. 
And indeed, once again there is a mysterious pattern, usually referred to 
as a family or generational structure. This once again suggests a deeper 
substructure. 

Unfortunately, this is as far as we’ve come. Indeed, all of our present-day 
knowledge down to this energy scale is gathered together into the so-called 
Standard Model of particle physics. The primary features of the Standard 
Model are as follows. The fundamental particles are the quarks and leptons. 
They are all fermions, and are arranged into three generations of doublets: 


wm (3)-(9-6) 
leptons : 2) ) (") (6.1.1) 


The fundamental forces also come in three varieties. First, there is the 
strong (or “color” ) force, associated with the non-abelian Lie group SU (3). 
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Its fine-structure constant is a3 ~ 1/8 (as measured at energy scales of 
approximately 100 GeV), and it is responsible for binding quarks together to 
form hadrons and nuclei. As such, it is felt only by quarks. Its mediators or 
carriers are called gluons. Second, there is the electroweak force, associated 
with the non-abelian Lie group SU(2). Its fine-structure constant is a2 ~ 
1/30 (indeed, weaker than the strong force!), and it is responsible for 8- 
decay. Unlike the strong force, it is felt by all of the fundamental particles. 
Finally, there is the “hypercharge” force, associated with the abelian Lie 
group U(1), with fine-structure constant a, + 1/59. Once again, this force 
is felt by essentially all particles, both quarks and leptons. The carriers 
of the latter two forces are the photon as well as the W= and Z particles. 
Indeed, ordinary electromagnetism is a combination of the electroweak and 
hypercharge forces, and is the survivor of electroweak symmetry breaking. 
This breaking is induced by the one remaining particle of the Standard 
Model, a boson called the Higgs particle. An excellent introduction to 
the physics of the Standard Model can be found in the TASI lectures of 
G. Altarelli (this volume). 


6.1.2. Beyond the Standard Model: Two popular ideas 


Is that all there is? Clearly, there are lots of reasons to believe in something 
deeper! First, the Standard Model contains many arbitrary parameters, 
such as the masses and “mixings” of fundamental particles. All of these 
must ultimately be fit to data rather than explained. Second, there are 
many conceptual questions. Why are there three generations? Why are 
there three kinds of forces? Why do these forces have different strengths and 
ranges? A fundamental theory should explain these features. Finally, there 
is also another force which we have not yet mentioned: the gravitational 
force. How do we incorporate the gravitational force into this framework? 
In other words, how do we “quantize” gravity? 

There is only one conclusion we can draw from this state of affairs. Just 
as in each previous case, there must still be a deeper underlying principle. 
It is important to stress that this is not simply an issue of academic interest. 
Rather, it is one of practical importance, because the next generation of 
particle accelerators are being built right now! (Two of the most prominent 
that will be exploring physics beyond the Standard Model are Fermilab, 
where upgrades to the TeVatron are being implemented, and CERN, where 
construction of the Large Hadron Collider (LHC) is already underway.) 
The pressing question, therefore, is: What do we expect to see at these 
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machines? What will high-energy physics be focusing on over the next ten 
to twenty years? It turns out that there are two very popular sets of ideas, 
both of which are thoroughly reviewed in the TASI lectures of N. Polonsky. 


6.1.2.1. Low-energy supersymmetry 


The first idea is supersymmetry (SUSY). This refers to a new kind of sym- 
metry in physics, one which relates bosons (particles with integer spin) to 
fermions (particles with half-integer spin). Thus, for every known particle, 
there is a predicted new particle, its so-called superpartner: 


quarks <= > squarks 
leptons <=> sleptons 
gauge bosons <> gauginos . (6.1.2) 


Clearly, this implies the existence of a lot of new particles and a lot of new 
interactions! Why then go through all this trouble? 

Well, it turns out that supersymmetry can provide a number of strik- 
ing benefits. First, through supersymmetry, we can explain the relative 
strengths of the forces (“gauge coupling unification”). Second, we can ex- 
plain the origin of electroweak symmetry breaking. Third, supersymmetry 
has a number of favorable cosmological implications (for example, super- 
symmetry provides a natural set of dark-matter candidates). Finally, it 
turns out that supersymmetry is the only known answer to certain difficult 
theoretical puzzles in the Standard Model (chief among them the so-called 
“gauge hierarchy problem”, i.e., the difficulty of explaining the lightness of 
the Higgs particle, or equivalently to difficulty of explaining the stability of 
the scale of electroweak symmetry breaking against radiative corrections). 
In order to serve as an explanation of the gauge hierarchy problem, the 
energy scale associated with supersymmetry must not be too much higher 
than the scale of electroweak symmetry breaking. This is therefore called 
“low-energy supersymmetry”, which refers to the common expectation that 
superparticles should exist at or near the TeV-scale. 

Supersymmetry is a beautiful theory, both phenomenologically and 
mathematically. But it is not observed in nature. Therefore, supersym- 
metry must be broken. The problem, however, is that supersymmetry 
is very robust! It turns out to be quite hard to find mechanisms that 
can easily (“spontaneously”) break supersymmetry at the expected energy 
scales. Therefore, we are faced with a major unsolved problem: How do 
we break supersymmetry? Indeed, we often have to resort to introducing 
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SUSY-breaking by hand, which requires the introduction of many addi- 
tional unknown parameters. This is quite unpleasant, not only from an 
aesthetic point of view but also a phenomenological (predictive) point of 
view. However, it is often possible to consider only a minimal supersym- 
metric extension to the Standard Model (the so-called MSSM) where a 
minimal number of supersymmetry-breaking parameters are chosen. 


6.1.2.2. Grand unification 


The second popular idea for physics beyond the Standard Model concerns 
so-called Grand Unified Theories (GUTs). This refers to an attempt to 
realize the different forces and particles in nature as different “faces” or 
“aspects” of a single GUT force and a single GUT particle. An electro- 
magnetic analogy here might be useful. Recall that the electric force is felt 
or caused by static charges, and that the magnetic force is felt or caused 
by moving charges. Are these therefore different forces? As we know, the 
answer is most definitely “no”: we can Lorentz-boost from a rest frame to a 
moving frame, whereupon the distinction between the electric and magnetic 
forces melts away and these forces become intertwined. Thus, we conclude 
that the electric and magnetic forces are merely different aspects of one 
force, the “electromagnetic” force. 

Is the same true for the strong, electroweak, and hypercharge forces? Is 
there a single “strong-weak-hypercharge” GUT force? 

At first glance, this doesn’t seem possible, because these different forces 
have different strengths. Recall their fine-structure constants: ay ~% 1/59, 
a2 ~ 1/30, and a3 ~ 1/8. However, also recall that in quantum field theory, 
the strengths of forces ultimately depend on the energy scale through which 
they are measured. To see why this is so, let us think of placing a positive 
charge next to a dielectric. The positive charge draws some negative charge 
from within the dielectric towards it, so that the dielectric medium partially 
screens the positive charge. Therefore, in a rough sense, the less of the 
dielectric we see (i.e., the more finely resolved our experimental apparatus 
to probe the original positive charge), the stronger our original positive 
charge seems to be. Thus, we see that at shorter distances (corresponding 
to higher energies), our electric charges (and therefore the corresponding 
electric forces) appear to be stronger. If this dielectric analogy serves as 
a good model for the results of a true quantum field-theoretic calculation 
(and in this case it does), we conclude that the electric force appears to 
grow stronger with increasing energy. 
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Of course, this is just a mechanical analogy. However, in the super- 
symmetric Standard Model, it turns out that the quantum field-theoretic 
vacuum itself indeed behaves like a dielectric for the hypercharge and weak 
forces. However, for the strong force, it behaves as an anti-dielectric. Thus, 
while the hypercharge and electroweak forces become stronger at higher 
energies, the strong force becomes weaker at higher energies. (This latter 
feature is the celebrated phenomenon of asymptotic freedom.) Together, 
these observations imply that these three forces have a chance of unifying 
at some energy scale if their strengths become equal, and indeed, carrying 
out the appropriate calculations, one finds the results shown in Fig. 6.1. 
We see from this figure that the forces appear to unify at the scale 


Mour ~ 2x 10!® GeV . (6.1.3) 


This would then be the natural energy scale for grand unification. Note that 
this unification also requires the existence of weak-scale supersymmetry in 
the form of weak-scale superpartners. Without such superpartners, the 
evolution of these fine-structure constants as a function of the energy scale 
is different, and they fail to unify at any scale. This then serves as another 
motivation for weak-scale supersymmetry. 

GUTs would have numerous important effects on particle physics. First, 
by their very nature, they would imply new interactions that can mix the 
three fundamental forces. Second, this in turn implies that GUTs naturally 
lead to new, rare decays of particles. The most famous example of this is 
proton decay, the rate for which is experimentally known to be exceedingly 
small (since the proton lifetime is Tp Z 10°? years). Third, GUTs would 
naturally explain the quantum numbers of all of the fundamental particles. 
Along the way, GUTs would also explain charge quantization. GUTs might 
also explain the origins of fermion mass. Finally, because they generally 
lead to baryon-number violation, GUTs even have the potential to explain 
the cosmological baryon/anti-baryon asymmetry. By combining GUTs with 
supersymmetry in the context of SUSY GUTs, it might then be possible 
to realize the attractive features of GUTs simultaneously with those of 
supersymmetry in a single theory. 

Both the SUSY idea and the GUT idea are very compelling. Certainly, 
the SUSY idea (and indirectly the GUT idea, through measurements of 
proton decay and other rare decays) will be the focus of experimental high- 
energy physics over the next 20 years. But high-energy theorists also have 
plenty of work to do — we must build theories in order to interpret the 
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Fig. 6.1. One-loop evolution of the gauge couplings within the Minimal Supersymmetric 
Standard Model (MSSM), assuming supersymmetric thresholds at the Z scale. Here 
a1 = (5/3)ay, where ay is the hypercharge coupling in the conventional normalization. 
The relative width of each line reflects current experimental uncertainties. 


data. But how do we build realistic SUSY theories? How do we build 
realistic GUT theories? How do we incorporate gravity? 

Clearly, the possibilities seem endless. And even the SUSY or GUT 
ideas have not answered our most fundamental questions, such as why 
there are three gauge forces, or why there are three generations. Therefore, 
it is natural to hope that there is yet a deeper principle that can provide 
some theoretical guidance. And that’s where string theory comes in. 


6.1.3. So what is string theory? 


The basic premise of string theory is very simple: all elementary particles 
are really closed vibrating loops of energy called strings. The length scale 
of these loops of energy is on the order of 10~*° meters (corresponding to 
101° GeV), so it is not possible to probe this stringy structure directly. 

This idea has great power, because it provides a way to unify all of 
the particles and forces in nature. Specifically, each different elementary 
particle can be viewed as corresponding to a different vibrational mode 
of the string. A pictorial representation of this idea is given in Fig. 6.2, 
where we are schematically associating higher vibrational string modes with 
string loops containing more “wiggles”. From the point of view of a low- 
energy observer who cannot make out this stringy structure, the different 
excitations each appear to be point particles. However, to such an observer, 
the states with more underlying “wiggles” appear to have higher spin. Thus, 
in this way we find that string theory predicts not only spin-1/2 and spin-1 
states (which can be associated with the fermions and gauge bosons of the 
Standard Model respectively), but also a spin-2 state (which can naturally 
be associated with the graviton). Thus, through string theory, we see that 
the gauge interactions, particles, and also gravity are unified into a common 
quantized description as corresponding to different excitation modes of a 
single fundamental entity, the string itself. 
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Fig. 6.2. The basic hypothesis of string theory is that the different elementary particles 
correspond to the different vibrational modes of a single fundamental entity, a closed loop 
of energy called a string. In this way one obtains not only spin-1/2 and spin-1 states 
which can be associated with the matter and gauge bosons of the Standard Model, but 
also a spin-2 state which can be identified with the graviton. Thus, string theory provides 
a way of unifying the Standard Model with gravity. 


Of course, this is not the end of the story. Just as a violin string has 
an infinite number of harmonics, so too does a string give rise to an infi- 
nite tower of states corresponding to higher and higher vibrational modes. 
Since it takes more and more energy to excite these higher vibrational string 
modes, such states are increasingly massive. Indeed, because the fundamen- 
tal string scale is on the order of Mstring © 101° GeV, these string states 
are quantized in units of Mstring. The states which we have illustrated in 
Fig. 6.2 are all massless with respect to Mstring, and correspond, in some 
sense, to the ground states of the string. These are the so-called “observ- 
able states”, and include not only the (supersymmetric) Standard Model 
and (super)gravity, but also may include various additional states (often 
called “hidden-sector states” which contain their own matter and gauge 
particles). However, there also exists an infinite tower of massive states 
with masses Mn © /nMstring, n E€ Z*. In most discussions of the phe- 
nomenological properties of string theory, these massive states are ignored 
(since they are so heavy), and one concentrates on the phenomenology of 
the massless states. One then presumes that they accrue (relatively small) 
masses through other means, such as through radiative corrections. 

Nevertheless, the passage from point particles to strings has tremendous 
consequences. Not only have we replaced the physics of zero-dimensional 
objects (elementary point particles) with the physics of one-dimensional 
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objects (strings), but we have also replaced the physics of the one- 
dimensional worldlines that they sweep out with the physics of two- 
dimensional so-called worldsheets. Likewise, we have replaced the physics 
of Feynman diagrams with the physics of two-dimensional manifolds, so 
that a tree diagram corresponds to a genus-zero manifold (a sphere) and 
a one-loop diagram corresponds to a genus-one manifold (a torus). These 
comparisons are illustrated in Fig. 6.3. Note that the latter descriptions 
as spheres and tori correspond to shrinking the external strings to points, 
essentially “pinching off” the external legs. This is a valid description for 
reasons to be discussed in Lecture #2. 

This is clearly a new language for doing physics. However, as we have 
seen, because string theory also includes gravity (which is exceedingly weak 
compared with the other forces), its fundamental mass scale is very high. 
Indeed, since the fundamental energy scale for gravity is the Planck mass 


he 
GN 


the string scale must also be very high. Indeed, to a first approximation, it 
turns out that 


MPtanck = ~ 10’ GeV = (10° cm.) , (6.1.4) 


Mstring X string MPianck (6.1.5) 


where Qstring is the string coupling constant, typically assumed to be ~ 
O(1). Thus, we see that string theory is ultimately a theory of Planck- 
scale physics. 

There are lots of “formal” reasons for being excited about string theory. 
First, it turns out that string theory requires the existence of extra space- 
time dimensions in order to be consistent, and consequently we now have 
to consider physics in different numbers of dimensions as well as all sorts 
of geometric questions pertaining to different possible “compactification” 
scenarios. Second, string theory gives us a new perspective on the struc- 
ture of spacetime itself. For example, string theory gives rise to many novel 
Planck-scale effects. One of these is called T-duality: the physics of a closed 
string in a spacetime one of whose dimensions is compactified on a circle of 
radius R turns out to be equivalent to the physics of the same string in a 
spacetime in which the radius is Me ik /R. Thus, T-duality interchanges 
large radii and small radii, and suggests that our naive view of spacetime 
and its linear hierarchy of energy and length scales cannot ultimately be 
correct. Third, string theory also provides new types of strong/weak cou- 
pling dualities. These have proven useful for elucidating the strong-coupling 
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Fig. 6.3. In string theory, we replace (a) zero-dimensional elementary particles with one- 
dimensional strings; (b) one-dimensional worldlines with two-dimensional worldsheets; 
and (c) Feynman diagrams with two-dimensional manifolds. For example, tree dia- 
grams correspond to genus-zero manifolds (spheres), and one-loop diagrams correspond 
to genus-one manifolds (tori). 
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dynamics of not only string theory, but also field theory. Finally, there have 
even been novel applications to black-hole physics. The most famous exam- 
ple of this is the fact that various non-perturbative string structures called 
D-branes have provided the first statistical (i.e., microscopic) derivation of 
the Bekenstein-Hawking entropy formula S = A/4 that relates the entropy 
S of a black hole to its surface area A. Indeed, the above list only begins to 
scratch the surface of all of the many exciting recent formal developments 
in string theory. 

But we are phenomenologists, so it is natural to ask about the rest 
of high-energy physics. How does string theory connect with the rest of 
particle physics? 

Some of the answers to this question have already been given above. We 
have seen, in particular, that string theory is capable of reproducing the 
Standard Model as its low-energy limit. Moreover, as we have also seen, the 
Standard Model naturally emerges coupled with gravity. Furthermore, in 
many cases this entire structure is also joined with supersymmetry. Finally, 
this entire structure is also often joined with many properties of GUTs (such 
as gauge coupling unification). All of this comes out of the low-energy limit 
of string theory, in some sense automatically. 

There are also many other benefits to considering the application of 
string theory to particle physics. First, string theory provides us with new 
kinds of symmetries (so-called “worldsheet symmetries” ) which lead to pow- 
erful new constraints on the resulting low-energy phenomenology. Second, 
in principle* string theory has no free parameters, which leads to a very 
predictive theory. Third, string theory has no divergences — in some sense, 
string theory is a completely finite theory in which many of the trouble- 
some divergences associated with field theory are simply absent. Finally, it 
*In this connection, we hasten to emphasize the phrase “in principle”. Unfortunately, 
our relative inability to understand the non-perturbative structure of string theory often 
means that the pragmatic consequences of having no free parameters cannot be realized, 
and in practice one is often forced to introduce many parameters to reflect our ignorance 
of the underlying dynamics. This will be discussed in subsequent lectures. This situation 
is rather analogous to one that arises in the MSSM: we do not know how supersymmetry 
is broken, so we typically parametrize our ignorance through the introduction of various 
supersymmetry-breaking parameters. Likewise, in string theory, there are analogous 
questions which come under the heading of “vacuum selection”: we do not know how 
the non-perturbative dynamics of string theory selects a particular vacuum state. Thus, 
in order to proceed to make phenomenological predictions, we are often forced to assume 
a certain vacuum state, or to parametrize the vacuum via the introduction of essentially 
unfixed parameters. The important point, however, is that string theory is a complete 


theory in that it should in principle, by virtue of its dynamics, uniquely fix the values 
of all of its fundamental parameters. 
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turns out that string theory can even give rise to a new perspective on the 
Standard Model itself, and often provides new and simpler ways to perform 
calculations. 

These last three points (absence of free parameters, absence of diver- 
gences, and new ways to perform calculations) are truly remarkable. There- 
fore, let us pause to explain in an intuitive way why these features arise. 
First, let us explain why string theory has fewer free parameters. To do this, 
let us consider a Feynman diagram for a typical tree-level decay A — B+C, 
as shown in Fig. 6.4(a). In field theory, such a process depends on many sep- 
arate parameters ultimately associated with the separate propagators and 
vertices. Specifically, even though the propagators are determined once the 
masses and spins of the particles are specified, there still remains an inde- 
pendent choice as to the form of the vertex interaction. Thus, in a given 
field theory, there still remain many independent parameters to choose. In 
string theory, by contrast, there is no sharp distinction between propaga- 
tors and vertices; they melt into each other, and are essentially the same. 
Thus, once the propagators are determined, the vertices are also intrinsi- 
cally determined. This is one of the underlying reasons why string theory 
contains fewer free parameters than field theory. 

Next, let us discuss why string theory is more finite than field theory. 
To do this, let us consider a typical one-loop Feynman diagram, as shown in 
Fig. 6.4(b). In field theory, the virtual interactions occur at sharp spacetime 
locations x and y. This is ultimately the origin of the ultraviolet (7.e., short- 
distance) divergence as x — y. In string theory, by contrast, we have seen 
that there are no such sharp interaction points — essentially the interaction 
is “smoothed out” by the presence of the string. Thus, there is no sense 
in which the dangerous x — y limit exists, for there are no precise means 
by which one can define such interaction locations x and y. It is in this 
manner that string theory automatically removes ultraviolet divergences: 
the string itself, through its extended geometry, acts as a (Planck-scale) 
ultraviolet regulator. 

Finally, let us discuss why string theory can often give us simpler ways 
to perform calculations than in field theory. To do this, let us consider 
the total tree-level amplitude for a typical process A+ B —> C + D, as 
illustrated in Fig. 6.4(c). As we know, in field theory there are two separate 
topologies of Feynman diagram that must be separately considered: the s- 
channel diagram and the ¢t-channel diagram. In general, at any given order, 
there are many separate diagrams to evaluate, and one often finds that 
great simplifications and cancellations occur only when these individual 
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Fig. 6.4. (a) Illustration of the fact that string propagators and string vertices are not 
independent. (b) Illustration of the fact that string theory lacks many of the ultraviolet 
divergences that arise in field theory from the short-distance limit x — y. (c) Illustration 
of the fact that one string diagram often comprises many field-theoretic diagrams. 


contributions are added together. In string theory, by contrast, there is 
only one corresponding diagram to evaluate at any given order. Thus, the 
sorts of simplifications or cancellations that might occur in field theory are 
automatically “built into” string theory from the very beginning. In some 
sense, string theory manages to find a way to reorganize the field-theory 
diagrams in a perturbative expansion in a useful and potentially profitable 
way. Indeed, this observation has even led to the development of many new 
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techniques for evaluating complicated field-theoretic processes, particularly 
in QCD where the number of diagrams and the number of terms in each 
diagram can easily grow to otherwise unmanageable proportions. 

We thus see that in a number of ways, string theory is a very useful lan- 
guage in which we might consider thinking about particle physics. Indeed, 
in various aspects (such as finiteness, fewer parameters, etc.) it is supe- 
rior to field theory. But overall, the fundamental fact remains that if we 
are thinking about strings, we are abandoning our usual four-dimensional 
point of view of particle physics. Specifically, since each different par- 
ticle in spacetime is now interpreted as a different quantum mode exci- 
tation of an underlying string, we see that four-dimensional (spacetime) 
physics is now ultimately the consequence of two-dimensional (worldsheet) 
physics. Thus, everything we ordinarily focus on in field theory (such as 
the four-dimensional particle spectrum, the gauge symmetries, the cou- 
plings, etc.) are now all ultimately determined or constrained by worldsheet 
symmetries. 

And this brings us to string phenomenology. 


6.1.4. So what is string phenomenology? 


In order to understand what string phenomenology is, we can draw a useful 
analogy. Just as we are replacing the language of high-energy physics from 
field theory to string theory, we likewise replace field-theory phenomenol- 
ogy with string-theory phenomenology. The goals of string phenomenology 
are of course the same as those of ordinary field-theory phenomenology: 
both seek to reproduce, explain, and predict observable phenomena, and 
both seek to suggest or constrain new physics at even higher energy scales. 
Indeed, only the language in which we will carry out this procedure has 
changed. Thus, in some sense, string phenomenology is the “art” of using 
the new insights from string theory in order to understand, explain, and 
predict what physics at the next energy scale is going to look like. Or, re- 
calling that string theory is ultimately a theory of Planck-scale physics, we 
can say that string phenomenology is the “interplay” or “meeting-ground” 
between Planck-scale physics and GeV-scale physics. 

It is important to understand that we are not abandoning field theory 
completely. Nor would we want to. Field theory automatically incorporates 
many desirable features such as causality, spin-statistics relations, and CPT 
invariance (which in turn implies the existence of antiparticles). These are 
all generic predictions of field theory, and are the underlying reasons why 
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field theory is the appropriate language for particle physics. However, since 
string theory ultimately reduces to field theory in its low-energy limit, all of 
these features will still be retained in string theory. Moreover, as we have 
seen, string theory additionally predicts or explains gravity, supersymme- 
try, and the absence of ultraviolet divergences. Furthermore, as we shall 
see, string theory also automatically predicts the existence of gauge sym- 
metry, and even incorporates features such as gauge coupling unification. 
These are all generic predictions of string theory. It is for these reasons to 
believe that a change in language from field theory to string theory might 
be useful. 

String theory will also provide us with new tools for model-building, 
new mechanisms and new guiding principles. Let us give some examples. 
In field theory, there are many well-known ideas that are part and parcel 
of the model-building game: one must enforce ABJ anomaly cancellation 
(to preserve gauge symmetries); one can employ the Higgs mechanism (to 
generate spontaneous symmetry breaking and give masses to particles); 
one has the GIM mechanism (to preserve flavor symmetries); and one has 
supersymmetry (to cancel quadratic divergences). Likewise, in string theory 
there are analogous sets of ideas, many of which are extensions of their field- 
theory counterparts. For example, one has the so-called “Green-Schwarz” 
mechanism for anomaly cancellation (to preserve gauge symmetries); one 
has string vacuum shifting via pseudo-anomalous U(1) gauge symmetries 
(to generate spontaneous symmetry breaking and generate particle masses); 
one has spacetime compactification (to generate gauge symmetries); one has 
hidden string sectors (to break supersymmetry and impose selection rules); 
and one has massive towers of string states (to enforce finiteness). Thus, 
model-building proceeds, but with a different set of principles. 

There is also a much more subtle effect of changing our language from 
field theory to string theory. Ultimately, since four-dimensional physics is 
now derived from an underlying two-dimensional (worldsheet) theory, string 
phenomenology is ultimately much more constrained than field-theory phe- 
nomenology. One given worldsheet symmetry, which might serve as an “in- 
put”, can have various seemingly unrelated effects in the resulting spacetime 
phenomenological “output”. Thus, string theory not only leads to unex- 
pected connections or correlations between seemingly disparate spacetime 
phenomena, but can also give rise to entirely new phenomenological sce- 
narios that could not have been anticipated within field theory alone. We 
will see many examples of this in the coming lectures. 
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Thus, we see that string phenomenology does many things and has many 
goals: 


e to provide a new framework for addressing and answering numerous 
phenomenological questions; 

e to provide a rigorous test of string theory as a theory of physics; 

e to explore the interplay between worldsheet physics and space- 
time physics (i.e., to ultimately determine which “patterns” of low- 
energy phenomenology are allowed or consistent with being realized 
as the low-energy limit of an underlying string theory); and 

e to augment field theories of “low-energy” physics into the string 
framework so as to give them the full benefits of the language of 
string theory. 


Because of these different roles, string phenomenology occupies a rather 
central position in high-energy physics: it allows the transmission of ideas 
from high-scale string theory to guide “low”-scale particle physics, and vice 
versa. This situation is illustrated in Fig. 6.5. At the lowest energies (lower 
left), string phenomenology has direct relevance for the Standard Model, 
where it can potentially explain features such as the choice of the gauge 
group, the number of generations, and numerous other parameters such as 
the masses and mixings of Standard-Model particles. At slightly higher 
energies (lower right), we see that string phenomenology can also suggest 
or constrain various extensions to the Standard Model, such as SUSY and 
SUSY-breaking, grand unification, and hidden-sector physics. At the high- 
est energies (upper left), string phenomenology is also concerned with the 
more formal aspects of string theory: such important questions include 
string vacuum selection, non-perturbative string dynamics, string duality, 
and new mathematical structures and techniques. And string phenomenol- 
ogy even has relevance outside the strict confines of particle physics. For 
example, string theory should have a profound impact on cosmology (upper 
right), where important stringy issues include the role of the dilaton, the 
effects of many other light degrees of freedom (the so-called moduli), the 
possibility of extra spacetime dimensions, the cosmological constant prob- 
lem, and even more exotic ideas such as topology change. As illustrated 
in Fig. 6.5, string phenomenology sits at the center of this web of ideas. 
Exploring the connections between the different corners of this figure is, 
therefore, the job of the string phenomenologist. Indeed, through string 
phenomenology, one “uses” string theory in order to open a window into 
the possibilities for physics beyond the Standard Model. 
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String Cosmology: 
Formal Issues: 
e role of dilaton 
e string vacuum selection e effects of light d.o.f.’s 
e non-pert. dynamics e extra dimensions 
e string duality e cosmological constant 
e new mathematical problem 
structures... e Hagedorn phenomenon 
e topology change... 


String Phenomenology 


Extensions to SM: 


Standard Model: 
e SuSY 


; --MSSM 
choice of gauge group --supergravity 
number of generations --SUSY-breakin 
masses © GUTs I 
mixing angles... e hidden sectors 


e technicolor... 


Fig. 6.5. String phenomenology is the central “meeting-ground” between Standard- 
Model physics, extensions to the Standard Model, formal string issues, and string 
cosmology. 


6.1.5. Plan of these lectures 


For much of the past decade, string phenomenology has been practiced 
assuming a particular type of underlying string theory, the so-called per- 
turbative heterotic string. Therefore, this string will be the focal point of 
most of these lectures. However, it turns out that the heterotic string is 
built directly on the foundations of two other kinds of strings, the bosonic 
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string and Type IT superstring. Indeed, in a sense to be made more pre- 
cise in Lecture #5, one can view the heterotic string as the “sum” of the 
bosonic string and the superstring string. Therefore, in these lectures, we 
will have to start at the beginning by studying first the bosonic string, then 
the Type II string, and finally the heterotic string. Indeed, this situation is 
analogous to the way in which one often studies quantum field theory: first 
one learns how to quantize the Klein-Gordon field, then the Dirac field, and 
finally the gauge field. In a certain sense, the bosonic string is the analogue 
of the Klein-Gordon field, while the Type II superstring is the analogue of 
the Dirac field and the heterotic string is the analogue of the gauge field. 
Of course, this analogy is only a pedagogical organizational one, since the 
heterotic string itself will ultimately contain all of the phenomenological 
properties (e.g., scalars, fermions, and gauge symmetries) that we desire. 

In Lecture #2, we will therefore give a brief introduction to the bosonic 
string, stopping only long enough to develop the ideas and techniques we 
will need for later applications. In Lectures #3 and #4, we will then 
proceed to develop the Type II superstring, once again focusing on only 
those aspects that will be useful for later applications. Finally, in Lecture 
#5, we will arrive at our destination: the heterotic string. In Lecture 
#6 we will construct some ten-dimensional heterotic string models, and 
in Lecture #7 we will develop a useful set of rules for heterotic string 
model-building. 

It is important to note, however, that all of string phenomenology is not 
based on the heterotic string. Particularly over the past decade, there has 
been a profound shift in our understanding of both string theory and its 
phenomenological implications. One of the consequences of this so-called 
“second superstring revolution” has been a new emphasis on yet another 
class of strings, the Type I (open) strings. Within this class, so-called 
intersecting D-brane models have shown great promise in yielding chiral, 
Standard-Model-like spectra. Indeed, there has even emerged a new super- 
structure which promises to relate all of these strings to each other: this 
structure is called M-theory, and is deeply tied to many non-perturbative 
aspects of string theory which are still being understood. Needless to say, 
these recent developments have the potential to completely change the way 
we think about string theory and string phenomenology. We will therefore 
discuss some of these modern developments in the final Lecture #8. Nev- 
ertheless, the bulk of these lectures will primarily be focused on the more 
traditional aspects of string phenomenology that concern the weakly cou- 
pled heterotic string. Indeed, this affords the best introduction to string 
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theory and string phenomenology, regardless of the future directions that 
string theory and string phenomenology might ultimately take. 

We also remind the reader that our goal here is to provide an intro- 
duction to string theory that avoids mathematical complications wherever 
possible, and which “gets to the physics” as rapidly as possible. Therefore, 
in many places, we will simply assert a mathematical result to be true, 
leaving its derivation to be found in various textbooks on the subject. For 
this purpose, we recommend Volume I of the textbook Superstring Theory, 
by M.B. Green, J.H. Schwarz, and E. Witten (henceforth to be referred to 
as GSW?). In fact, our initial approach will be very similar to that of GSW, 
and we will continually refer back to this textbook as we proceed. Another 
recommended textbook with a more modern mathematical perspective is 
Introduction to String Theory, by J. Polchinski. Likewise, A First Course 
in String Theory by B. Zwiebach is particularly useful for students who 
may lack a full background in relativistic quantum field theory. 


6.2. Lecture #2: Strings and their spectra: The bosonic 
string 


6.2.1. The action 


We begin by studying the simplest string of all: the bosonic string. As we 
discussed in Lecture #1, the physics of a string is ultimately described by 
the shape it takes (e.g., its vibrational mode of oscillation) as it propagates 
through an external spacetime and thereby sweeps out a two-dimensional 
worldsheet. Therefore, we must first have a way of describing the shape 
of this worldsheet. To this end, we parametrize the worldsheet by two 
worldsheet coordinates (01,02) as illustrated in Fig. 6.6, and describe the 
embedding of this worldsheet into the external spacetime by giving the 
spacetime coordinates X of any location (01,02) on the worldsheet. Thus, 
the physics of the string is ultimately encapsulated in the embedding func- 
tions X"“(o1,02), where u = 0,1,..., D —1. Here D is the total spacetime 
dimension, which we shall keep arbitrary for now. 

Given these embedding functions, we can attempt to write down an ap- 
propriate action for the string. To do this, we first note that as we might 
TNot to be confused with another great GSW trio, namely Glashow, Salam, and Wein- 
berg. One can only hope that someday string theory will be as well-established, both 
theoretically and experimentally, as the GSW electroweak theory. This may sound a 


bit optimistic, but a possible new experimental direction for string theory and string 
phenomenology will be discussed in Lecture #8 in the context of the brane world. 
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Fig. 6.6. The string worldsheet can be parametrized by two worldsheet coordinates 
(01,02). Thus, the location in the external spacetime of any point on the string world- 
sheet is described by a set of functions X(o1, 02). It is convenient to think of c1 as a 
spacelike worldsheet coordinate, and a2 as a timelike worldsheet coordinate. 


expect, strings have tension — i.e., strings generically have a non-zero en- 
ergy per unit length. In other words, it takes energy to stretch a string and 
to give the worldsheet a larger area. Thus, as the string propagates along 
in spacetime, we expect on physical grounds that this string should choose 
a configuration that minimizes the area of the worldsheet. This leads us 
to identify the string action with the area of the corresponding worldsheet. 
Indeed, this results in the so-called Nambu-Goto action, which involves a 
non-trivial square root of the X# coordinates. For certain calculational pur- 
poses, however, this square root is often problematic. Fortunately, however, 
there exists an alternative action, the so-called Polyakov action, which is 
classically equivalent to the Nambu-Goto action but which does not involve 
fractional powers of the X coordinates. This action is given by 


1 


ae ~ Ana 


- [ee VIR guv Oa Xap X” . (6.2.1) 
Here guv is the metric of the external spacetime, hag is the metric of 
the worldsheet, the worldsheet derivative is given by 0g = 0/00%, and 
h = dethag. In the prefactor, a’ is a dimensionful constant (called the 
Regge slope) with units of (length)?. Since these units are equivalent to 
length/energy, we see that a’ is an inverse tension, and indeed the string 
tension T turns out to be related to a’ via T = (27a’)~+. We shall discuss 
the numerical value of a’ below. Note that the action (6.2.1) is manifestly 
spacetime Lorentz-invariant. 
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Before proceeding further, it may be useful to draw an analogy be- 
tween this action and the analogous action for a point particle propagating 
through spacetime and sweeping out a worldline rather than a worldsheet. 
The worldline can be parametrized by a single coordinate ø, which func- 
tions as a proper time along the worldline. The point-particle action can 
then be written in the form 


Spoint particle = z fa (ex Juv Og X" ðs X” — eth?) (6.2.2) 


where 77 is the mass of the point particle and where e(c) is an auxiliary field 
(a so-called einbein). Solving for e(o) through its equation of motion and 
substituting back into (6.2.2) yields an action proportional to the length 
of the worldline and involving a square root. Thus, we see that the string 
action (6.2.1) is nothing but the generalization of the point-particle action 
(6.2.2), where we have associated 


ea) <> h%(o1,02) , m=0. (6.2.3) 


In other words, the string action (6.2.1) is the two-dimensional generaliza- 
tion of the action of a massless point particle, where the worldsheet metric 
functions as an auxiliary field (a “zweibein”). This masslessness property 
will be crucial shortly. 

It is now possible to make some simplifications. Perhaps the most obvi- 
ous is to restrict our attention to a flat spacetime and take guv = nuv. We 
shall do this throughout these lectures. A much more subtle simplification, 
however, is to simplify the worldsheet metric. Let us therefore pause to 
discuss how this can be done. 

One of the first things we realize is that the ultimate physics of the 
string should not depend on the particular choice of coordinate system 
(01,02) on the string worldsheet. After all, on purely physical grounds, 
we know that the particular choice of worldsheet coordinate system cannot 
have a physical effect, for the same worldsheet geometry can ultimately be 
described using an infinite variety of coordinate systems which differ from 
each other through relative reparametrizations or rescalings. (Indeed, in 
the point-particle case, we are likewise free to reparametrize our proper- 
time variable along the particle worldline.) Therefore, the string action 
should have a symmetry that makes it invariant under reparametrizations 
and rescalings of the worldsheet coordinates. Note, in particular, that the 
invariance under rescalings follows from the fact that we chose our string 
action (6.2.1) to generalize that of a massless point particle. In other words, 
we have taken mñ = 0 in (6.2.3). While it is possible to add terms to the 
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action of the bosonic string which mimic the effects of possible mass terms 
and which explicitly break the scale invariance of the bosonic string, we 
shall not need to consider such theories in these lectures. 

The symmetry that comprises both reparametrizations and rescalings of 
the worldsheet coordinates is called conformal symmetry, and the bosonic 
string action (6.2.1) is thus said to be “conformally invariant”. Clearly, 
this symmetry must hold not only at the classical level, but also at the 
quantum level, for we would not have a consistent theory if this symmetry 
were broken by quantum anomalies. Conformal invariance of the action is 
a very powerful physical tool which will play an important role throughout 
these lectures, and indeed the mathematical structure underlying conformal 
symmetry and its implications is a deep and beautiful subject which we 
will not have time or space to discuss here. A recommended starting point 
is Applied Conformal Field Theory (Proceedings of Les Houches, Session 
XLIX, 1988), by P. Ginsparg. Therefore, in order to proceed, we will have 
to make the first of many “great leaps”, and take certain results on faith. 
Our first great leap will therefore be the following: 


Great Leap #1: Conformal invariance of the string 
action allows us to replace the string metric hag with 
the two-dimensional Minkowski metric jag without loss of 
generality. 


This then results in the simplified bosonic string action 


1 


S = - 
4ra 


[ee Oy X" X, . (6.2.4) 


Looking at the action (6.2.4), we see that it has two possible inter- 
pretations. The first interpretation is the one that we have already been 
following: minimizing this action is classically equivalent to minimizing the 
worldsheet area. This follows directly from the interpretation of X” (01, 02) 
as the spacetime coordinates of a given worldsheet position (01,02). Note 
that this action is invariant under SO(D — 1,1) Lorentz transformations 
of the spacetime coordinates, with the index p interpreted as a spacetime 
vector index relative to the Lorentz group. We shall refer to this as the 
spacetime interpretation. 

There is, however, a completely different interpretation of (6.2.4): this 
is the action of a two-dimensional quantum field theory where the two di- 
mensions refer to the worldsheet coordinates and where the “fields” are 
nothing but the functions X“(01, 02), u = 0,1,..., D — 1. Indeed, we see 
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that these spacetime coordinate functions are simply a collection of D dif- 
ferent massless bosonic Klein-Gordon fields which happen to exhibit an 
internal SO(D — 1,1) rotation symmetry (analogous to a gauge symmetry) 
between them. In such a case, the index p is simply an internal symmetry 
index which tells us that the X” fields transform as vectors with respect 
to the internal SO(D — 1,1) symmetry. We shall refer to this as the world- 
sheet interpretation. Indeed, it is because this string action contains only 
bosonic worldsheet fields that we call this the bosonic string. In such a 
description, spacetime is not a fundamental concept but rather a “derived” 
concept: it results from the interpretation of various worldsheet fields as 
spacetime coordinates, and from the interpretation of an internal symmetry 
as a spacetime Lorentz symmetry. It is indeed remarkable that such differ- 
ent interpretations can be made of the same physics, and we shall often go 
back and forth between these different worldsheet and spacetime points of 
view. 

Given these two descriptions of the action, we can also understand the 
origin of the Regge slope parameter a’ on dimensional grounds. Let us first 
take the worldsheet point of view, so that our length dimensions are de- 
termined with respect to the coordinates (01,702). In such a case, we know 
that the ordinary Klein-Gordon action does not require any dimensionful 
prefactor, for f d?0(0.X“)? is indeed dimensionless when the Klein-Gordon 
field X” is itself dimensionless. However, from the spacetime point of view, 
we see that X” cannot be dimensionless, for we ultimately need to inter- 
pret this field as a spacetime coordinate with units of length. Thus, we 
are forced to compensate by inserting a dimensionful prefactor a’ in front 
of the action. In other words, the need for the dimensionful prefactor a’ 
arises from the need to interpret our dimensionless (scale-free) worldsheet 
theory as a dimensionful (spacetime) theory. Or, to put it slightly differ- 
ently, the parameter a’ is the dimensionful conversion factor that describes 
the overall scale of the embedding of the dimensionless worldsheet physics 
into the dimensionful spacetime. We shall see this phenomenon very often 
throughout these lectures: the worldsheet physics is by itself scale-invariant 
(since it generalizes the physics of a massless point particle with 7m = 0), 
and it is only in the conversion to dimensionful spacetime quantities that 
the overall scale a’ plays a role. Thus, a’ sets the overall spacetime mass 
scale of string theory, often called the string scale: 


eR 


Mstring = : (6.2.5) 


A 
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A priori, this mass scale is unfixed, but we shall see shortly how this scale 
is ultimately determined. 

Now that we have established the worldsheet picture and the space- 
time picture, it is easy to see how they are related to each other: each 
quantum excitation of the Klein-Gordon worldsheet fields X” corresponds 
to a different particle in spacetime. Thus, the study of string theory can 
be reduced to the study of a two-dimensional quantum field theory! For 
example, particle scattering amplitudes in spacetime can be re-interpreted 
as the correlation functions of our two-dimensional worldsheet fields, eval- 
uated on various two-dimensional manifolds. Of course, as we have stated 
above, this is not just any two-dimensional quantum field theory, for phys- 


ical consistency also requires the presence of conformal symmetry. Thus, 
from this point of view, string theory is the study of two-dimensional con- 
formal field theories. In two dimensions, it turns out conformal symmetry 
is extremely powerful, for it gives rise to an infinite number of conserved 
currents. Indeed, two-dimensional conformal symmetry is often sufficiently 
powerful to permit the exact evaluation for many scattering amplitudes. 

In the case in question, the particular conformal field theory that con- 
cerns us is that of D free massless bosonic fields X#, u = 0,1,...,D—1. 
However, just as with any symmetry, there is always the danger of quantum 
anomalies. Nevertheless, it is straightforward to show that 


Great Leap #2: Conformal invariance of the string 
action is preserved at the quantum level (i.e., all quantum 
anomalies are cancelled) if and only if D = 26. 


This is clearly a big result, and we will not have space to provide a 
proper mathematical derivation of this fact. At the very least, however, we 
can give a guide as to the most useful way of thinking about this result. 
Note that our D bosonic fields are identical to each other and essentially 
decoupled from each other. Therefore, each contributes the same amount 
to any potential anomaly. This amount is called the central charge, and the 
central charge c of each bosonic field X will be denoted cx. It turns out 
that cx = 1, and therefore the total central charge from the D bosonic 
fields is Cfeas = D. However, it can be shown that there also exists 
a “background” central charge (7.e., a background quantum anomaly) of 
magnitude Chackground = —26. Thus, the total anomaly is cancelled only 
if D = 26. Clearly, the most mysterious part of this discussion is the ori- 
gin of this “background” central charge. In technical terms, it reflects the 
contributions of the conformal ghosts that arose when we used the confor- 
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mal symmetry to set (or “gauge-fix”) the worldsheet metric hag > Nag. 
However, all we will need to know for the future is that the value of the 
“background” anomaly Cpackground depends on only the particular symme- 
try of the worldsheet action that we are dealing with. In the present case, 
this worldsheet symmetry is simply conformal invariance, and the corre- 
sponding background central charge corresponding to conformal invariance 
iS Chackground = —26. Therefore, we see that the total conformal anomaly 
is cancelled only if D = 26. This is typically called the critical dimension 
of the bosonic string. 

We see, then, that string theory is able to determine the spacetime di- 
mension as the result of an anomaly cancellation argument! It is worth re- 
flecting on how this happened by considering an analogous situation in field 
theory, namely the cancellation of the triangle axial anomaly. We know that 
this anomaly is cancelled only for very particular combinations of particle 
representations (e.g., we require complete generations of Standard-Model 
fields, with three colors of quark for every lepton). So we are used to the 
idea that anomalies are extremely sensitive to the field content of the the- 
ory. In string theory, however, we have seen that the analogous worldsheet 
field content is parametrized by the spacetime dimension. More worldsheeet 
fields correspond to more spacetime dimensions. Therefore, just as triangle 
anomaly cancellation requires three colors, conformal anomaly cancellation 
requires 26 dimensions. 

Of course, our world does not consist of 26 flat spacetime dimensions, 
and we shall ultimately need to find a way of reducing this to a four- 
dimensional theory. For now, however, we can just think of the present 
bosonic string as a 26-dimensional toy model. 


6.2.2. Quantizing the bosonic string 


Let us now quantize this theory. Having already noted that the action 
(6.2.4) is nothing but the action of a set of 26 Klein-Gordon fields X“, we 
already know how to proceed: in the usual fashion, we introduce a Fourier- 
expansion of the fields X“, and interpret the coefficients of this expan- 
sion as creation and annihilation operators obeying canonical quantization 
relations. 

Because we ultimately wish to interpret the fields X” as spacetime co- 
ordinates, we must first impose the constraint 


X"(o, + 1,02) = X"(01, 02) (6.2.6) 
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where we have chosen to normalize the length of the closed string as m. In 
other words, the spacetime coordinates must be single-valued as we make 
one complete circuit around the closed string. This is the first place where 
we have essentially incorporated the requirement that we are dealing with 
closed strings whose topology is that of a circle. Moreover, because of this 
topology (and because of the linear nature of the wave equation resulting 
from the action (6.2.4)), we know that we can also decompose any possible 
quantum excitation of the wiggling string into a superposition of modes 
that travel clockwise around the string (in the direction of, say, decreasing 
cı) and those that travel counter-clockwise (in the direction of increasing 
cı). These are respectively called left-movers and right-movers. We can 
therefore decompose each of our Klein-Gordon fields into the form 


X" (01,02) = XE (o1 +02) + XR — 02) ; (6.2.7) 


The most general mode-expansion consistent with the boundary condition 
(6.2.6) is then 


X" (01,02) = ah +p” O2+t= | Bg eur haa) ae On et?in(oi—o2) ; 
n 


n#0 
(6.2.8) 
which decomposes into 
u Lop 4 u t ah, —2in(o1+02) 
Xt (o +02) = zT + oP (o1 + 02) + xo Pa 
nÆ0 
g a ar in(a1.—o 
Xlo — 02) = $a” — zoa — 02) + OD i (o1=02) (6.2.9) 
nÆ0 


Here / = V2a’ is a fundamental length that has been inserted on dimen- 
sional grounds. 

It is easy to interpret the different terms in (6.2.8) and (6.2.9). Clearly 
the final terms in each line represent the internal quantum vibrational oscil- 
lations of the string, where a% and â} are the left-moving and right-moving 
creation/annihilation operators corresponding to vibrational modes of a 
given frequency n. We shall discuss these operators shortly. Note that 
the contribution from the “zero-mode” has been separated out and written 
explicitly in the form x! + $£?(o1 + 02) for the left- and right-movers re- 
spectively. In the case when there are no quantum excitations (so that we 
can ignore the final exponential terms), these “zero-modes” are all that re- 
main of the mode-expansion, whereupon we see from (6.2.8) that the total 
X" field takes the form X” = z” + ¢?pa2. Interpreting c2 as the timelike 
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coordinate on the string worldsheet, we thus see that x” is nothing but the 
center-of-mass position of the string, and p” its center-of-mass momentum. 

Let us now consider the quantization rules that we must impose. The 
first one (for the zero-modes) is easy: we simply impose the usual commu- 
tation relation |z”, p”] = ihn"”. We shall henceforth set h = 1. The excited 
modes also have a similar commutation relation. First, note that because 
the X fields are interpreted as spacetime coordinates, they are necessarily 
real. This implies that we must identify a” „ = (a#)', with a similar result 
for the right-moving oscillator modes. In other words, the negative modes 
create excitations, while the positive modes annihilate the same excitations. 
Given this, we then can immediately write down the commutation relation 
for the creation/annihilation operators: 


[at ay] = Mobminnh” , [ar AY] = Mominn” . (6.2.10) 


Here we have introduced the notation 6; = 62,9 = 1 if x = 0, and = 0 if 
x #0. Note that these are exactly the harmonic oscillator commutation 
relations, except that we have rescaled each mode a, by its corresponding 
frequency n in (6.2.9). Thus, an = @,/./n obey the usual harmonic os- 
cillator commutation relations. This rescaling has become conventional in 
string theory, and we shall retain it here. Likewise, it is often conventional 
to define the zero-mode af = ivoa pë. 

Given this mode-expansion, we can now construct the corresponding 
number operators 


1 ~ 1 
0: N, = >a”, Qn N, = —G, an 6.2.11 
n > rn QA nnu ; = QX nAnu ( ) 


which count the number of excitations of the nt! frequency modes of the 
string. Once again, this is completely analogous to the harmonic-oscillator 
creation/annihilation modes, after we take into account the rescaling a, = 
VNan and the hermiticity condition a_y = a}. 

Likewise, we can also write down the total energy of the system. To do 
this, let us consider the different contributions to the total energy. First, 
there is the energy associated with the internal quantum vibrational oscil- 
lations of the string. As we might expect, this is given by 


co co 

(osc) _ z u 

Lo = > nNn = > On Any 
n=1 n=1 


LE = Sonn = So at any - (6.2.12) 
n=1 n=1 
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For convenience, we are defining these energy operators in such a way that 
they are dimensionless numbers (i.e., they are worldsheet energies). These 
Lo operators are often called Virasoro generators, which are more generally 
defined Lm = >>, af, Qnp- These generators are nothing but the different 
frequency modes of the total worldsheet stress-energy tensor, and together 
they satisfy the so-called Virasoro algebra. We shall only consider Lo in 
these lectures. 

Next, there is the energy of the zero-modes, which correspond to the 
net center-of-mass motion of the string. This is given by 


al 


Le”) = ata, = gr Pe 
/ 
Ee = aH Go, = gr Pe . (6.2.13) 


Note that factors of a’ must appear in order to counter-balance the fact 
that the center-of-mass momentum p” is a spacetime quantity, and hence 
dimensionful. 

Finally, there is the possibility of an overall non-zero vacuum energy 
for both the left-movers and the right-movers. In other words, there is no 
reason to assume that the vacuum state (the state without any excitations) 
is exactly at zero energy. This is important, of course, since string theory 
is ultimately a theory which will contain gravity, and it is precisely in the- 
ories containing gravity that the overall zero of energy becomes important. 
Indeed, mathematically, one can imagine that due to the commutation re- 
lations (6.2.10), there can be an overall normal-ordering ambiguity in the 
definitions in (6.2.12), and this overall normal-ordering constant would be 
our “vacuum energy”. 

Thus, denoting the left- and right-moving vacuum energies as az, R, we 
have the total left- and right-moving energies 


H = LLL) a, E SEO LO tar. (6.2.14) 


These are the total worldsheet Hamiltonians. 

Clearly, the important thing to do at this stage is to determine the 
vacuum energies az, r. Of course, the symmetry between left-movers and 
right-movers requires ag = ar. Calculating this vacuum energy can be 
done in numerous ways, each of which would take too much space for our 
purposes. Once again, we refer the reader to Chapter 2 of GSW, where a 
full calculation is given. Therefore, it is time for another 
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Great Leap #3: Conformal invariance of the string 
action implies that az = ar = —1. 


Finally, in order to determine the total spacetime mass of a given string 
state, we must have a mass-shell condition for the string. Rather than 
provide a rigorous derivation (for which we again refer the curious reader 
to GSW), we can instead give an intuitive argument which suggests the 
proper answer. In a quantum field theory of point particles, the mass ™ is 
a parameter that appears in the Lagrangian through an explicit mass term 
that might be generated in some separate manner, e.g., through the Higgs 
mechanism. Since a point particle has no internal degrees of freedom beyond 
those associated with its center-of-mass motion, such a mass parameter m 
would then be directly identified with M, the resulting physical mass of the 
particle. Such a physical mass M is the quantity satisfying the condition 
pY py, = —M?, or equivalently the condition no = Lee) = —a' M? /4. 
In the special case of a massless particle (for which îm = M = 0), this 
mass-shell condition then takes the simple form Lo = Lon =0. 

A similar condition emerges in string theory. We have already seen that 
our string action (6.2.1) generalizes that of a massless particle, which again 
suggests that our effective Lagrangian mass parameter m vanishes. Indeed, 
as we have discussed, this is the root of the scale invariance of the string 
action (6.2.1). However, unlike the point-particle case, a string does have 
additional, purely internal degrees of freedom — these are the oscillations 
of the string itself, whose additional energy contributions are represented 
by LE, LEO, and az, r. Thus, even though m = 0, the resulting string 
state can still have a non-zero physical mass M in spacetime. Indeed, 
just as the mass-shell condition for massless point particles is given by 
LE = toe = 0, the mass-shell condition for our scale-invariant string 
is generalized to H = H = 0. This then becomes our scale-invariant mass- 
shell condition in string theory. Of course, spacetime Lorentz invariance 
still allows us to identify the physical spacetime mass M of a given string 
state via the relations oe = Tm = —a'M?/4. Thus, the string 
mass-shell conditions H = H = 0 lead to the identifications 
1 


Za M? = I 1., (6.2.15) 


1 OSC 
Za M? = LO) 1, ; & 


4 


Note that these two conditions can also be written in the form 


aM? = 2 (ay pS 2) (6.2.16) 
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where we must obey the constraint 
LO ee Te, (6.2.17) 


Interpreting the conditions (6.2.16) and (6.2.17) is easy. The condition 
(6.2.16) simply tells us that the physical spacetime mass M of a given string 
state (and thus the square of its center-of-mass momentum) is generated 
solely from its internal left- and right-moving vibrational excitations. The 
condition (6.2.17), by contrast, tells us that the mass of the string must 
come equally from left-moving and right-moving excitations. The latter 
condition (6.2.17) is often referred to as the level-matching condition, since 
it implies that a given string oscillator state is considered to be “on shell” 
(or “physical” ) only if the total excitation level of the left-movers matches 
the total excitation level of the right-movers. This condition implies that 
the string does not have an unbalanced “wobbling”, for if such a wobbling 
existed, it could ultimately be used to determine a preferred coordinate 
system on the worldsheet (thereby breaking conformal invariance). Indeed, 
demanding invariance under shifts in the a; variable leads directly to the 
condition (6.2.17). We remark, however, that states not satisfying (6.2.17) 
are nevertheless important for understanding the “off-shell” or “virtual” 
structure of string theory. Such “virtual” states contribute, for example, 
within loop amplitudes. In these lectures, however, we shall focus on only 
the so-called “tree-level” string spectrum for which the level-matching con- 
straint (6.2.17) is imposed and the corresponding physical masses are given 
by (6.2.16). 


6.2.3. The spectrum of the bosonic string 


Having discussed the quantization of the bosonic string, we can now ex- 
amine its spectrum. The procedure is simple: we simply consider all pos- 
sible combinations of left- and right-moving mode excitations of the string 
worldsheet, subject to the level-matching constraint (6.2.17), and then we 
tensor these left- and right-moving states together to form the total re- 
sulting string state. The spacetime mass of this string state is then given 
by (6.2.16), and the properties of the state are deduced directly from the 
underlying vibrational configuration of the string. 
The simplest state, of course, is the string vacuum state 


10) R ® l0); (6.2.18) 


in which the right- and left-moving vacuum states are tensored together. 
This state trivially satisfies (6.2.17), which indicates that this state is indeed 
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part of the physical string spectrum. Unfortunately, we see from (6.2.16) 
that this state has a negative squared mass — i.e., the spacetime mass 
of this state is imaginary! This state is thus a tachyon. Making sense of 
this string state is problematic, and is one of the reasons that we shall not 
ultimately be interested in the bosonic string. 

Let us continue, however. The first excited string state is 


àt R ® a%4|0), . (6.2.19) 


This state has LE = LO = 1, and according to (6.2.16) is therefore 
massless. As evident from its Lorentz index structure, this state transforms 
under the spacetime Lorentz group as the tensor product of two spin-one 
Lorentz vectors. We can therefore decompose this tensor product into a 
spin-two state (the symmetric traceless component), a spin-one state (the 
antisymmetric component), and a spin-zero state (the trace). Mathemat- 
ically, this is equivalent to the tensor-product rule for Lorentz transverse 
SO(24) vector representations: 


V21®Vna = 1 © 276 & 299 (6.2.20) 


where Vs is the eight-dimensional vector representation, and where the 1 
representation is the spin-zero state, the 276 representation is the spin-one 
state, and the 299 representation is the spin-two state. 

How can we interpret these states? A massless spin-two state must, by 
Lorentz invariance, have equations of motion which are equivalent to the 
Einstein field equations of general relativity. Thus, we are forced to iden- 
tify the spin-two (traceless symmetric) component of the state (6.2.19) as 
the graviton guv, which is the spin-two mediator of the gravitational inter- 
actions. The spin-one (antisymmetric) state within (6.2.19) is an antisym- 
metric tensor field, often denoted B,,,, and the spin-zero (trace) component 
is the so-called dilaton, denoted ¢. Together, (gy, Byv,) are called the 
gravity multiplet. 

By identifying (6.2.19) with the gravity multiplet, we see that string 
theory becomes a theory that contains gravity! This in turn allows us to 
determine the value of our previously unfixed mass scale a’. We shall now 
sketch how this happens (with details available in GSW). It turns out that if 
one calculates loop amplitudes in string theory, one finds that e~® serves as 
a loop expansion parameter (i.e., higher-loop amplitudes come multiplied 
by more powers of e~®). Given this observation, it is natural to identify the 
string coupling constant as the vacuum expectation value of the dilaton: 


Jstring = e~‘) : 6.2.21 
8 
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This string coupling constant describes the strength of string interactions. 
Given this definition, we then find that the graviton state couples to matter 
with the expected gravitational strength only if we choose 


G 
a’ = — wen (6.2.22) 
Istring 
where Gnewton is Newton’s constant. Substituting this result into (6.2.5), 
we then find 


Mstring = Qstring Mpianck s (6.2.23) 


where Mpianck = 1/WGNewton. Thus, because it contains gravity, string 
theory becomes a theory whose fundamental mass scale is related to the 
Planck scale. 

We can also construct more and more massive string states. Ultimately, 
these fill out an infinite tower of string states. It is clear that such additional 
states all have a’M? > 0. Given the above value for a’, this implies that 
these additional states all have Planck-scale masses. Such Planck-scale 
excited states are therefore not of direct relevance for string phenomenology. 
Let us note, however, one interesting fact about these states. For any given 
spacetime mass level M, the string state with maximum spin is achieved 
by exciting only the lowest vibrational modes a", and a@“,. We thus find 
that for a given spacetime mass M, the maximum spin Jmax that can be 
realized is 


a'M? = 2JImax—4. (6.2.24) 


For example, we see that the maximum spin that can be realized for a 
massless state is J = 2 (the graviton). The relation (6.2.24) was originally 
observed for hadron resonances, and historically gave rise to the so-called 
“dual resonance models” (which eventually became modern string theory). 
In such dual resonance models, the relation (6.2.24) describes a so-called 
“Regge trajectory”, with a’ serving as the so-called “Regge slope”. It is for 
this reason that in modern string theory, we continue to refer to a’ as the 
Regge slope. 

Before concluding, let us briefly mention one further important issue. In 
ordinary four-dimensional quantum field theory, we know that a massless 
spin-one state (e.g., a photon) naively has four distinct states (correspond- 
ing to the four components of a vector field A“). However, the underlying 
gauge invariance allows us to make a unitary gauge choice wherein only 
two of these states (the two helicity states) are truly physical. The timelike 
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and longitudinal states decouple, leaving only the transverse components. 
In the above description of the string spectrum, however, we have taken 
a covariant approach analogous to the description of a photon as a four- 
component vector. One might then wonder which of these states are truly 
physical. This issue is an important one in string theory, and once again 
we cannot here provide a proper proof. We shall therefore make recourse 
to another 


Great Leap #4: The physical string states are those 
which are realized by exciting the oscillator modes of only 
the transverse coordinates XŻ (i = 1,...,24). 


Proving this statement requires showing that even after we have used con- 
formal invariance to set the string worldsheet metric to nag, there still 
remains sufficient freedom to make a further “gauge” choice wherein we set 
the oscillator modes of the timelike and longitudinal spacetime coordinates 
to zero. This gauge choice, which is called light-cone gauge, is thus the 
analogue of unitary gauge in quantum field theory, and essentially tells us 
that only the 24 transverse coordinates correspond to physical degrees of 
freedom in the string worldsheet action. An important by-product of this 
fact is that every remaining string state has a non-negative norm. This 
is non-trivial. For example, if our metric signature is chosen such that 
n% = —1, then the state a“—°|0) has a negative norm. However, one can 
demonstrate that in light-cone gauge all resulting states are physical and 
have non-negative norm. 


6.2.4. Summary 


Let us quickly review those features of the bosonic string that we shall need 
to bear in mind in subsequent lectures. We shall separate these features 
into worldsheet features and spacetime features. 

Worldsheet: The worldsheet fields consist of D copies of the left- and 
right-moving spacetime coordinates X¥ and X} (the worldsheet bosons). 
The fact that these X coordinates are periodic as we traverse the closed 
string loop implies that they have integer modings a, and Q,, where n € Z. 
The relevant worldsheet symmetry is conformal invariance, which tells us 
that the number of these X” fields is D = 26 and also tells us that the 


vacuum energy corresponding to these fields is az = ar = —1. As we have 
stated above, a useful way to think about these results is to imagine that 
there is a “background” conformal anomaly Chackground = —26, and that 


each X” field makes a contribution cx, = 1. In general, the “background” 
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conformal anomaly is only a function of the relevant worldsheet symmetry 
(in this case conformal invariance), and it will always remain true that 
cx = 1. Thus, cancellation of the conformal anomaly requires D = 26. 
A similar interpretation can also be given to the vacuum energy. When 
calculating the vacuum energies, only the physical (i.e., transverse) fields 
are relevant. It is a general result that each X field contributes ax = —1/24 
to the vacuum energy. Therefore, we find a, = ar = 24ax = —1. 

Spacetime: The above worldsheet theory leads to the following features 
in spacetime. We find that the spacetime dimension (often called the critical 
spacetime dimension) is 26. The spectrum consists of a spinless tachyon, 
as well as a massless gravity multiplet consisting of the graviton guv, the 
antisymmetric tensor B,,, and the dilaton ¢. There is also an infinite tower 
of massive (Planck-scale) string states. 

Comments: Two remarkable things have happened. First, we have a 
theory of quantized gravity! The graviton has emerged as the quantum 
excitation of a closed string. This alone is very exciting, but also somewhat 
mysterious. We started by assuming a closed string propagating through 
an external, fixed, flat spacetime. But this string itself includes a graviton 
mode, which implies a distortion in that background spacetime. This then 
acts back to change the worldsheet theory. Thus, in some sense, the string 
itself not only “creates” the spacetime in which it propagates, but is then 
affected by this change in the spacetime geometry. This coupling or inter- 
play between the string and its spacetime is not fully understood, and is 
clearly at the heart of the many mysterious features of string theory as a 
theory of quantum gravity. 

A second remarkable thing has also happened, although we have not 
demonstrated it explicitly. As indicated in (6.2.21), a coupling constant 
has been determined not as a free parameter, but rather dynamically as 
the vacuum expectation of a string field. It is in this sense that string 
theory contains no free parameters, and that all parameters such as coupling 
constants are determined dynamically. 

There are, however, a number of drawbacks to this bosonic string the- 
ory. First, it contains a tachyonic state. We must somehow find a way 
to eliminate this. Second, all string excitations are spacetime bosons (i.e., 
they have integer spin). We must find a way to obtain spacetime fermions. 
Third, there are no massless spin-one states (which we would wish to as- 
sociate with gauge fields). Thus, there are no gauge symmetries. It is 
for these reasons that we shall go on to consider more complicated string 
theories. 
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And finally, there is another major drawback that we need to be aware 
of. Although it is compelling that the string coupling gstring is in principle 
determined dynamically, as the vacuum expectation value of the dilaton 
scalar field, in practice we do not understand how to calculate the poten- 
tial of the dilaton field and thereby deduce its vacuum expectation value. 
In the bosonic string we are considering here, the dilaton potential V(¢) is 
actually divergent for all ¢ < oo, and so this question cannot be meaning- 
fully addressed. However, even in the more realistic string theories to be 
discussed, this potential is either completely flat (as happens in a super- 
symmetric context), or generally takes a shape that sends (¢) — oo. This 
is the famous dilaton runaway problem. Solving this problem is perhaps 
one of the most important (unsolved) problems in string phenomenology. 

How can we remedy these features? One possibility is prompted by the 
appearance of the tachyon. In ordinary quantum field theory, the existence 
of a tachyon (a state with a negative mass-squared) signals that the vacuum 
has been misidentified (as in the Higgs mechanism); the theory then “rolls” 
to a different vacuum configuration in which the tachyon is eliminated. So 
it is natural to speculate that perhaps the bosonic string theory also “rolls” 
to a new vacuum in such a way that the tachyon is no longer present and the 
dilaton is stabilized. Perhaps fermions and gauge fields might also appear 
in this new vacuum, as desired. However, as we have already indicated, it 
is not known how the bosonic string behaves in this context. We do not 
know if there exists a new (“stable”) vacuum to roll to, and if so, what 
its properties might be. Of course, knowing the potential V(¢) would be 
extremely useful, yet as we indicated this potential is naively divergent 


and therefore requires some knowledge of the non-perturbative structure of 
string theory. So (at least for the time being) this option does not appear 
promising. 

A second possibility, then, is simply to abandon the bosonic string and 
attempt to construct a new string theory altogether. And this is what we 
shall now do. 


6.3. Lecture #3: Neutrinos are fermions: The superstring 


As we saw in the last lecture, the bosonic string has two glaring failures: it 
contains a tachyon, and it does not give rise to spacetime fermions. Both 
of these features are troubling, especially since the announced goal of these 
lectures is to derive a neutrino from string theory, and we know that the 
neutrino is a fermionic object. We therefore seek to construct a new string 
theory which can give rise to excitations with half-integer spins. 
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6.3.1. The action 


We have already seen that string theories are defined by their two- 
dimensional worldsheet actions. Thus, in order to construct a new string 
theory, we must construct a new worldsheet action. At the very least, this 
action should contain that of the bosonic string, since we still wish to re- 
tain the spacetime interpretatation that we had previously. Thus, our only 
option is to introduce additional worldsheet fields into the action: 


1 


Ara! 


S=- 


[ee (3a XOX, +). (6.3.1) 


What new fields can we add? If our goal is to produce spacetime 
fermions, a natural guess would be to add worldsheet fermions! These would 
complement the worldsheet bosonic fields X” that are already present. For 
the moment, let us denote such fermionic fields schematically as ~. We 
would then attempt to consider an action of the form 


s=- 


1 Q Te 
a [ee (O.X"O X, + wip da) 3 (6.3.2) 


Here ¢)(01, 02) represents our two-dimensional fermionic fields, and p® are 
an appropriate set of two-dimensional Dirac matrices (the analogues of the 
y” matrices in four dimensions). 

We then face a number of questions. First, how many w fields must we 
add? Second, what kinds of worldsheet fermions should these be? Should 
they be Dirac fermions, or Majorana fermions, or Majorana- Weyl] fermions? 
Third, how should these two-dimensional spinors w transform under the 
(internal) SO(D — 1,1) spacetime Lorentz symmetry? We already know 
that the X” fields, for example, transform as vectors under this symmetry. 
Note that it is not obvious that the ~ fields should necessarily transform 
as spinors under SO(D — 1,1) and carry a spacetime spinor index. In 
particular, all we know thus far is that the w fields transform as spinors un- 
der worldsheet two-dimensional Lorentz transformations. This does not 
a priort give us any information about their spacetime transformation 
properties. 

There is also another potential worry that appears if we try to add 
new worldsheet fields. We have already seen in the bosonic string that 
worldsheet conformal invariance was sufficiently powerful a symmetry to 
allow us to choose a light-cone gauge and thereby eliminate all negative- 
norm states. However, the presence of new worldsheet fields implies the 
existence of new quantum excitation modes in the resulting string spectrum, 
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and some of these new states may also have negative norm. Thus, conformal 
symmetry may no longer be sufficient (and indeed would not be sufficient) 
to allow us to eliminate these states as well. 

It turns out that all of these questions have a common answer: we can 
impose an extra symmetry beyond simple worldsheet conformal invariance. 
Indeed, the extra symmetry that we shall impose is nothing but worldsheet 
(i.e., two-dimensional) supersymmetry. Specifically, we shall require that 
the w fields be the two-dimensional superpartners of the X fields, so that 
the resulting action has a manifest worldsheet (two-dimensional) supersym- 
metry.* This new theory will be called the superstring. 

It is important to stress that this supersymmetry that we will be dis- 
cussing is not the spacetime supersymmetry that might be seen in the 
next round of accelerator experiments. Instead, this is a worldsheet super- 
symmetry which stems directly from the worldsheet interpretation of the 
original Polyakov action (6.2.4), and which relates the worldsheet bosons 
X to worldsheet fermions w via a worldsheet supercurrent J. 

Imposing this worldsheet supersymmetry then answers all of the ques- 
tions we previously raised. How many w fields? The answer is D, one 
for each boson X”. What kind of w spinor? The answer is a Majo- 
rana (two-component) spinor. How does the w field transform under the 
SO(D — 1,1) spacetime Lorentz symmetry? The answer is that the 7 field 
must transform as a vector under the Lorentz symmetry, since the X” field 
(for which it is the worldsheet superpartner) also transforms as a vector. In 
other words, the worldsheet supersymmetry commutes with the spacetime 
Lorentz symmetry, and thus does not change the Lorentz index structure. 


*We remark that this is only one possible choice, and will ultimately lead us to the so- 
called Ramond/Neveu-Schwarz (RNS) formalism. Another possible choice would be to 
demand spacetime supersymmetry, and to imagine that the ~w fields are the Grassmann 
coordinates 0 of a super-spacetime. This possibility would then lead to the so-called 
Green-Schwarz (GS) formalism. It turns out that these two formalisms are ultimately 
equivalent, however, and both provide suitable descriptions of the resulting superstring 
theory. This equivalence is possible because the RNS superstring ultimately also has 
spacetime supersymmetry (as we shall discover below). In these lectures, however, we 
shall restrict our attention to the RNS formulation in which the ~ fields are worldsheet 
(rather than spacetime) superpartners of the X”# fields. Aside from being more useful for 
string phenomenology, the RNS formalism has the philosophical advantage that it treats 
the string as the fundamental object, with the spacetime structure emerging as a derived 
consequence. The RNS formalism thus reinforces one of the central themes of these 
lectures, namely that we define a string theory by its worldsheet properties alone, and 
then deduce the spacetime effects of these properties as consequences. The GS formalism, 
on the other hand, has the benefit of being manifestly spacetime supersymmetric from 
the very beginning. 
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Thus, the w fields transform as spacetime vectors, and carry a spacetime 
vector index: Y” (01,02). 

This last point may initially seem confusing, so we reiterate: the w 
fields are worldsheet fermions, but spacetime bosons! They transform as 
spinors under worldsheet Lorentz transformations, but as vectors under the 
spacetime Lorentz transformations. 

Given this, we can now explicitly write down the superstring action: 


1 


4ra’ 


S = 


I Bo (aX 3X, — ibpp*dap") . (6.3.3) 


Our worldsheet fields are X” (01, 02) and ~"(o1, 02), and the pu index (with 
u = 0,1,2,..., D—1) is a vector index with respect to the internal symmetry 
SO(D — 1,1). From the worldsheet perspective, each X” is a scalar field 
(containing one component), while each ~ is a two-component spinor. The 
p“ are two-dimensional Dirac matrices satisfying the two-dimensional Clif- 
ford algebra {p%, pf} = —2n%?, and Y = wi p®. One can then show that the 
action (6.3.3) is invariant under the worldsheet supersymmetry transforma- 
tions 6X" = ew, dv = —ip%e0,X", where € is a constant anticommuting 
spinor that parametrizes the “magnitude” of the supersymmetry transfor- 
mation. The corresponding generator of this worldsheet supersymmetry 
transformation is the worldsheet supercurrent Ja = 4 pf Pat" OGX p- 

It is convenient to choose a particular Weyl (chiral) representation for 
the two-dimensional p“ matrices: 


0 -i 0 i 1 0 
0 1 O01 
g a ae R a a RER z Eve ce) 


Here the product p?p! plays the role of the chirality operator (the analogue 
of ys in four dimensions), and thus in this basis we can identify the upper 
and lower components of the two-component Majorana spinor p as being 
left-moving and right-moving respectively. Our worldsheet action (6.3.3) 
then decomposes into the form 


S= 2 [ee (Og X"#O°X, — VprO- Yh — Yur ðt) (6.3.5) 


4ra’ 


where Ox are derivatives with respect to the left- and right-moving world- 
sheet coordinates o1 + o2. The worldsheet content of this theory there- 
fore consists of D left-moving worldsheet bosons Xz, D right-moving 
worldsheet bosons Xg, D left-moving worldsheet Majorana-Weyl (one- 
component) fermions Yz, and D right-moving worldsheet Majorana-Weyl 
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(one-component) fermions Yr. There are two worldsheet supercurrents in 
this theory: 


Jt = pur 04 Xi, Jr = Yurô- X$. (6.3.6) 


Note that our original goal in constructing the superstring had been to 
obtain spacetime fermions. However, it may seem from the above that we 
have failed in this regard, since we have only introduced new fields w which 
themselves are spacetime vectors. How then are we to obtain spacetime 
fermions? It turns out that this will happen in a surprising way. 

Let us proceed to analyze this string following the same steps as we 
used for the bosonic string. First, we see that our worldsheet symmetry 
has been enlarged: rather than simply have conformal invariance, we now 
have conformal invariance plus worldsheet supersymmetry. Together, this 
is called superconformal invariance, which is a much larger symmetry than 
conformal invariance alone. 

This enlargement of the worldsheet symmetry changes many of the fea- 
tures of the resulting string. The most profound is the value of the space- 
time dimension D. Recall from our discussion of the bosonic string that as- 
sociated with each worldsheet symmetry there is a particular “background” 
conformal (central charge) anomaly, and that it is necessary to choose a suf- 
ficient number of worldsheet fields so as to cancel this anomaly and ensure 
that conformal invariance is maintained even at the quantum level. The 
same argument applies here as well, except that 


Great Leap #5: The “background” conformal anomaly 
associated with superconformal invariance is not c = —26 
but rather c = —15. Likewise, the conformal anomaly 
contribution from each worldsheet Majorana fermion is 
c= 1/2. 


We can understand the origin of the “background” conformal anomaly c = 
—15 as follows. Just as in the bosonic string, a certain contribution c = 
—26 is attributable to the conformal ghosts resulting from conformal gauge 
fixing. The new feature here is that we now have an additional contribution 
+11 which is attributable to the worldsheet superpartners of these ghosts. 
Together, this produces a background anomaly c = —15. What this means 
is that we must choose the number D of worldsheet bosons and fermions 
such that this “background” anomaly is cancelled. We have already seen 
that the anomaly contribution from each worldsheet boson X” is cx = 1. 
Since the anomaly contribution from each Majorana fermion is cy = 1/2, 
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we must satisfy 


D(i+4)-15=0 D=10. (6.3.7) 


Thus, we see that the critical dimension of the superstring is D = 10 rather 
than D = 26. Moreover, just as for the bosonic string, the superconformal 
symmetry of the superstring worldsheet action again allows us to choose a 
light-cone gauge in which only eight transverse bosons and eight transverse 
fermions represent the truly physical propagating worldsheet fields. 


6.3.2. Quantizing the superstring 


Let us now quantize the superstring, just as we did for the bosonic string. 
The boundary conditions (6.2.6) for the X” fields remain valid even for the 
superstring, since the X“ continue to have the interpretation of spacetime 
coordinates. Therefore the mode-expansions (6.2.9) continue to apply. 
The only new feature, then, is the mode-expansion for the fermionic 
fields Y”. However, unlike the bosonic fields X” which must be periodic 
because of their interpretation as spacetime coordinates, these fermionic 
fields %” do not have any immediate interpretation in spacetime. Therefore, 
the only boundary conditions that might be imposed on these fields are 
those that are required directly from the symmetries of the action. In 
particular, we must choose boundary conditions for the Y” fields so as to 
maintain the single-valuedness of the action as we traverse the closed string 
(i.e., as 01 > 01 +77), and so as to maintain the worldsheet supersymmetry 
of the action (whose algebra includes a requirement that the supercurrent 
square to the Hamiltonian, i.e., J: J ~ H). It turns out that are only 
two choices of boundary conditions that satisfy these requirements. One 
possibility is that the ~ fields are periodic under 0, — 0, + T: 


Ramond: ~y"(o,+7,02) = + W"(o1,02) . (6.3.8) 


Such periodic boundary conditions are typically called “Ramond” (R) 
boundary conditions, after P. Ramond (who introduced these fermionic 
boundary conditions in 1971). The second possibility is that the y” fields 
are anti-periodic under a; —> 0, + T: 


Neveu-Schwarz: w'(o,+7,02) = — "(o1,02) . (6.3.9) 


Such periodic boundary conditions are typically called “Neveu-Schwarz” 
(NS) boundary conditions, after A. Neveu and J. Schwarz (who introduced 
these fermionic boundary conditions in 1971). As we shall see in Lecture 
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#4, both of these boundary conditions are ultimately required for the self- 
consistency of the superstring. 

In the case of periodic (Ramond) boundary conditions, the mode- 
expansion of the ~ field resembles that of the X” field: 


Ramond: wi (o1 + 02) = a bP e~2in(or to2) 
neZ 
PRla — 02) = X bH etneo) | (6.3.10) 
neZ 


Here b!, bH are the (fermionic) creation and annihilation operators, satisfy- 
ing the anti-commutation relations 


where we recall the hermiticity condition b“,, = (b#)'. The same relations 
hold for the right-moving modes as well. This hermiticity condition follows 
from the fact that the ~ fields are Majorana (i.e., real) fields. Note that 
unlike the bosonic mode-expansion (6.2.9), we have joined the zero-modes 
together with the excited modes in (6.3.10).' There is also no “center-of- 
mass” term in the mode-expansion (a fermionic analogue of z”) because the 
w fields are Grassmann variables and thus lack a classical limit. Finally, 
also note that unlike the bosonic a# modes, which are rescaled relative 
to the usual harmonic oscillator modes by powers of the mode frequency 
n, the fermionic 6% modes are defined without this rescaling and hence 
satisfy the usual harmonic-oscillator commutation relations (6.3.11) di- 
rectly. This too is traditional in string theory. 

In the case of anti-periodic (Neveu-Schwarz) boundary conditions, the 
mode-expansion of the ~ field involves half-integer rather than integer 
modes: 


Neveu-Schwarz: (01 +02) = 5 bt e72ir(o1+02) 
rEzZ+1/2 
phla -o= XO betro). (6.3.12) 
r€Z4+1/2 


tWe are cheating slightly here, since the treatment of Ramond zero-modes for Majorana 
worldsheet fermions is actually quite subtle. In some sense, each Majorana fermion has 
only “half” a zero-mode. We will provide a rigorous discussion of this fact in Lecture #5. 
In the meantime, it will suffice to ignore this subtlety. 
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Once again, bt, bH are the (fermionic) creation and annihilation operators, 
satisfying the anti-commutation relations 


{be bgt = 1” Opts (6.3.13) 


where we have the hermiticity condition b“,. = (b#)!. 

The expressions for the total energy of a given string configuration 
now receive contributions from not only the bosonic oscillator modes, as 
in (6.2.12), but also the fermionic oscillator modes. These new contribu- 
tions are given by 


R: Ler) = X, n bË „bn 
n=0 
NS: RSS Eius (6.3.14) 
r=1/2 


with similar expressions for the right-movers. 

Finally, we must consider the vacuum energies az and ap for the su- 
perstring. Recall that for the bosonic string, each of the 24 transverse X 
fields contributed ax = —1/24, yielding a total of ag = ar = —1. This 
contribution from each bosonic field remains the same for the superstring, 
so we continue to have ax = —1/24. It therefore only remains to determine 
the vacuum-energy contributions from the worldsheet Majorana fermions, 
and it is found that 


Great Leap #6: Each Ramond fermion contributes 
vacuum energy ay = +1/24, whereas each Neveu-Schwarz 
fermion contributes vacuum energy ay = —1/48. 


We thus see that like the bosons, the Neveu-Schwarz fermions contribute 
negative vacuum energies, while Ramond fermions contribute positive vac- 
uum energies. 

Given these mode-expansions and commutation relations, it is instruc- 
tive to consider the Fock space of an individual Ramond (R) or Neveu- 
Schwarz (NS) fermion. It turns out to be simplest to consider the Fock 
space of an individual (left- or right-moving) NS fermion first. The two 
lowest-lying states are 


vacuum: |0) Le =0 


first-excited state:  b_42|0) L@) 1/2. (6.3.15) 
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Note that relative to the vacuum, all further excited states are reached 
through only half-integer excitations. Also note that the vacuum of the NS 
Fock space is unique, just like that of the bosons X”. What this means is 
that from the spacetime perspective, the vacuum is spinless (and hence a 
spacetime bosonic state), and that all subsequent excitations of the vacuum 
are also spacetime bosons. Recall, in this connection, that the fermion mode 
operators b are only fermionic from the worldsheet perspective; they are still 
bosonic operators (just like the fields Y” themselves) relative to spacetime 
Lorentz symmetries. 

Let us now consider the corresponding Fock space for the Ramond 
fermions with periodic boundary conditions. Once again, we have a tower 
of states 


vacuum: |0) LEO =0 
first-excited state: b_1|0) Le) = 1 (6.3.16) 


which now continues upwards through integer, rather than half-integer, 
steps. However, in this case it is important to observe that we also have a 
zero-mode in the theory. The existence of this zero-mode means that it is 
possible to excite this zero-mode without increasing the overall energy of 
the state. We therefore have the additional tower of states 


vacuum: bi |0) LNSS =0 
first-excited state: b_1b},|0) Le =1. (6.3.17) 


(Note that bo and bi are equivalant.) In other words, combining (6.3.16) 
and (6.3.17), we see that the Ramond vacuum consists of two degenerate 
states, 


l0) and  bĵlo), (6.3.18) 


and that all further excitations maintain this two-fold degeneracy. 

How can we interpret this two-fold degeneracy of the Ramond vacuum? 
It may seem, at first, that both of the states in (6.3.18) cannot be considered 
as the true vacuum, because the second state in (6.3.18) appears to be 
realized as a zero-mode excitation of the first. However, let us define the 
first state in (6.3.18) as |Vo) and let us also define |V1}) = v2b{]0}, which is 
a rescaling of the second state in (6.3.18). Then using (6.3.11), it is easy to 
show that 


IVa) = V205|Vo), Yo) = V201M4) . (6.3.19) 


300 Keith R. Dienes 


Thus, we see that neither state in (6.3.18) is more fundamental than the 
other, and there exists an unbroken symmetry between them — they are 
realized as zero-mode excitations of each other. The interpretation of this 
fact is that the true Ramond vacuum state is a two-component object, 
a spacetime spinor! It then follows that all of the excited states in the 
Ramond spectrum are also spacetime spinors, since they are realized as 
non-zero-mode excitations of a spinorial ground state. 

Of course, the above discussion is only suggestive, since we have not 
proven that these two vacuum states actually form a Lorentz spinor rep- 
resentation with respect to the spacetime Lorentz algebra. However, it is 
easy to see that this is indeed the case. Observe from (6.3.11) that the zero- 
modes satisfy the algebra {b}, b4} = n"”. Thus, if we define T” = v2ibh, 
then we see that {[“,[’} = —2n"”, which is nothing but the spacetime 
Clifford algebra. In other words, the zero-modes act as spinorial gamma- 
matrices. This implies that all states built upon such a vacuum state will 
transform in spinor representations of the spacetime Lorentz symmetry 
group SO(D — 1,1), and hence will be spacetime fermions. 

This is a remarkable result. Even though we have introduced worldsheet 
w fields which are spacetime bosons and which carry a spacetime Lorentz 
vector index, the algebra of zero-modes in the case of Ramond boundary 
conditions has managed to change these vector indices into spinor indices 
and thereby produce spacetime fermions. Of course, this is completely 
analogous to what happens in the usual four-dimensional Dirac equation, 
where the y” matrices are matrices in a spinor space but nevertheless carry 
vector indices. Thus, we see that by choosing Ramond boundary conditions 
for worldsheet fermions, string theory affords us with the same possibility. 
We therefore now see that string theory can indeed give rise to spacetime 
fermions: while excitations of worldsheet Neveu-Schwarz fermions give rise 
to spacetime bosons, excitations of worldsheet Ramond fermions give rise 
to spacetime fermions. 


6.4. Lecture #4: Some famous superstrings 


The next step is to determine the spectrum of the full superstring, just as 
we did for the bosonic string. However, the presence of two possibilities 
(Neveu-Schwarz and Ramond) for the modings of the fermions introduces 
several new complications relative to the bosonic string, and enables us to 
make different choices for what kind of superstring we wish to construct. 
These different choices are typically called different “string models”, and so 
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we are finally in a position to begin to discuss string model-building. That 
is the subject of the present lecture. 


6.4.1. String sectors 


Recall from the previous lecture that in light-cone gauge, the worldsheet 
field content of the ten-dimensional superstring consists of eight right- 
moving bosons Xp, eight right-moving Majorana-Weyl (one-component) 
fermions wp, and a similar set of left-moving fields Xz and Yz. The bosons 
Xz and Xp must have periodic (integer) modings because of their inter- 
pretation as spacetime coordinates, but their worldsheet fermionic super- 
partners Yz and qr can have either Ramond (periodic, integer) or Neveu- 
Schwarz (anti-periodic, halfinteger) modings. The question then imme- 
diately arises: What rules govern the possible self-consistent choices of 
fermion modings? A priori, the appearance of 16 distinct fermions would 
seem to lead to 218 different choices. 

It is easy to see that not all possibilities are allowed, however. One quick 
way to see this is to realize that if some of the right-moving fermions had 
different periodicities than other right-moving fermions, then these different 
periodicities would necessarily break spacetime Lorentz invariance because 
these fermions carry a spacetime vector index u. A similar situation would 
also hold for the left-moving fermions. This would then imply that all of 
the right-moving fermions should have the same periodicity as each other, 
and that all of the left-moving fermions should have the same periodicity 
as each other (though not necessarily the same as that of the right-moving 
fermions). However, this argument is not really satisfactory because we 
do not necessarily wish to preserve the full ten-dimensional Lorentz in- 
variance (or even its eight-dimensional transverse subgroup); after all, our 
sole phenomenological requirement is that four-dimensional Lorentz invari- 
ance must be maintained. Moreover, it goes against the spirit of string 
theory (as we have been presenting it) that we should demand a certain 
phenomenological property of the resulting spacetime physics when formu- 
lating our worldsheet theory. In string theory the spacetime physics is a 
consequence of the worldsheet physics, and we would ultimately like to base 
our worldsheet choices directly on worldsheet symmetries. 

Fortunately, it is easy to find a worldsheet argument that leads to the 
same constraint. Recall that the worldsheet symmetry that we must main- 
tain is superconformal invariance. The worldsheet supersymmetry that 
makes up superconformal invariance is generated by the two worldsheet 
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supercurrents given in (6.3.6). Because these two supercurrents are also 
worldsheet fermionic, they may also be either periodic or anti-periodic as 
we traverse the closed string. Indeed, each individual term y“OX,, in these 
supercurrents will have the periodicity property of the fermion Y#. How- 
ever, in order for each of these supercurrents Jr and Jz to have a unique, 
well-defined periodicity as we traverse the closed string, we see that it is 
necessary that all right-moving fermions have the same periodicity as each 
other, and that all left-moving fermions have the same periodicity as each 
other. This is required in order to preserve worldsheet supersymmetry. 
Thus, we have our first constraints on fermion modings: 


e All right-moving fermions Y% must have the same periodicity as 
each other, either Ramond or Neveu-Schwarz. 

e All left-moving fermions Y? must have the same periodicity as each 
other, either Ramond or Neveu-Schwarz. 


Note that there is no requirement that the right- and left-moving periodic- 
ities be the same. 


Table 6.1. The four possible sectors of 
the ten-dimensional superstring, numbered 1 
through 4. Here ‘NS’ and ‘R’ respectively 
indicate Neveu-Schwarz (anti-periodic) and 
Ramond (periodic) boundary conditions for 
worldsheet fermions, and ag and az respec- 
tively denote the corresponding right- and 
left-moving vacuum energies. 


Dee Ee Lee [oe | 

NS AS =1/2 a] 
T are E 

NS ~1/2 

Given these constraints, we see that we are left with four distinct pe- 
riodicity choices for our sixteen Majorana-Weyl worldsheet fermions, as 
shown in Table 6.1. Each individual choice is called a sector or spin struc- 
ture of the superstring, so we see that the ten-dimensional superstring has 
four possible sectors. For future convenience, these sectors have been num- 
bered in Table 6.1. We have also indicated the corresponding right- and 
left-moving vacuum energies of these sectors. Recall from the previous lec- 


ture (in particular, Great Leap #6) that the vacuum-energy contribution of 
each Ramond fermion is +1/24, while that of each Neveu-Schwarz fermion 


2 
3 
4 
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is —1/48 and that of each worldsheet boson is —1/24. Therefore, generally 

assuming nyns Neveu-Schwarz fermions and np Ramond fermions, we can 

add these individual contributions to find 
1 


a= — zg (8— mr + gms) at SS aa (6.4.1) 


The second equality results from setting np = 8 — nyns. Of course, as 
discussed above, in the ten-dimensional superstring we are restricted to the 
cases nys = 0,8 for both the right- and left-moving fermions. 


6.4.2. Modular invariance and GSO projections 


The next question that arises is whether we are free to pick any one of these 
sectors to construct our superstring theory, or whether we must consider 
all of them together, superposing the spectrum from each sector separately 
in order to construct the full superstring spectrum. What rules govern the 
choices of sectors? 

Ultimately, it turns out that a special form of conformal invariance 
known as modular invariance will give us the answer. In keeping with the 
spirit of these lectures, we will not be able to provide a proper mathematical 
discussion of modular invariance. (Indeed, doing so would require a pre- 
liminary discussion of string partition functions and the modular group.) 
However, we can discuss the relevance and implications of modular invari- 
ance at a conceptual level. 

Recall from Lecture #2 that our string actions always have a certain 
symmetry known as conformal invariance, which reflects the fact that the 
action should be invariant under local reparametrizations and rescalings 
of the coordinates (01,02) that parametrize the string worldsheet. For 
tree-level string interactions, demanding this local symmetry is sufficient to 
ensure that the resulting physics is indeed invariant under arbitrary coor- 
dinate reparametrizations. This is because any tree-level string interaction 
has the topology of a sphere (a genus-zero surface, with no handles), and on 
a sphere it can be shown that any possible net coordinate reparametrization 
can be generated or “built up” in small steps as the cumulative effect of 
small, local coordinate reparametrizations. Geometrically, this is equivalent 
to saying that any closed loop on the surface of a sphere can be continuously 
shrunk to a point, as illustrated in Fig. 6.7(a), by sliding the loop along the 
surface of the sphere towards one side. Thus, demanding invariance under 
local coordinate reparametrizations (i.e., conformal invariance) by itself is 
sufficient to guarantee consistency for tree-level string amplitudes. 
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(a) (b) 


\ 
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\ 


no non-contractible two non-contractible 
loops on sphere loops on torus 


Fig. 6.7. (a) On a sphere, all closed loops can be continuously shrunk to a point. 
(b) On a torus, there exist two topologically distinct non-contractible loops. 


However, this situation changes drastically if we now consider one-loop 
amplitudes. As discussed in Lecture #1, these amplitudes have the world- 
sheet topology of a torus (a genus-one surface), and we see from Fig. 6.7(b) 
that on a torus there exist two types of closed loops that cannot be contin- 
uously shrunk to a point. Such loops are said to be non-contractible, which 
is indeed the defining property of such higher-genus surfaces. The presence 
of these non-contractible loops means that for torus diagrams, there exist 
possible coordinate reparametrizations that cannot be built up from local 
coordinate reparametrizations alone. Indeed, these reparametrizations non- 
trivially involve “large”, discrete mappings around these non-contractible 
loops. Thus, we see that demanding conformal invariance alone is not suf- 
ficient to ensure that one-loop string amplitudes are truly invariant under 
worldsheet coordinate reparametrizations: we must also demand an invari- 
ance under these “large” discrete mappings around these non-contractible 
loops. This additional global invariance is called “modular invariance” , and 
just like conformal invariance, it too stems from our need to maintain the 
overall invariance of the string under reparametrizations and rescalings of 
the worldsheet coordinates. 

One might wonder, at this stage, why we are suddenly worrying about 
modular invariance, whereas we did not need to consider modular invari- 
ance in Lecture #2 when we discussed the bosonic string. The truth of the 
matter is that we must always consider modular invariance in addition to 
conformal invariance, regardless of the type of (closed) string we are dis- 
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cussing. However, in the simple case of the 26-dimensional bosonic string, 
it turns out that all amplitudes are trivially modular-invariant, so we did 
not need to make recourse to modular invariance in order to distinguish 
between different possibilities. However, for the superstring (and partic- 
ularly for the heterotic string to be discussed later), the possible sector 
choices become quite numerous, and it turns out that modular invariance 
is the powerful tool by which we are able to narrow down the self-consistent 
possibilities. 

What, then, are the effects of modular invariance? It turns out that 
at the level of string model-building, modular invariance has two primary 
effects: 


e it forces us to consider only certain selected sets or combinations 
of underlying sectors, and 

e it produces new constraints (beyond the level-matching constraint 
Lo = Lo) that govern which Fock-space excitations are allowed in 
each sector. 


These new constraints are called GSO constraints, after F. Gliozzi, 
J. Scherk, and D. Olive who first imposed some of these constraints in 1977. 
The important point is that these conditions stem directly from modular 
invariance, and thus they follow from the worldsheet physics of the string 
and do not represent any additional arbitrary input. We will provide many 
explicit examples of such combinations and constraints shortly. 

In order to construct a fully consistent string model, therefore, our pro- 
cedure is as follows. First, we must determine which are the allowed sectors 
that need to be considered as part of our set. For each of these sectors in 
our allowed set, we then determine the corresponding Fock space of phys- 
ical states by applying not only the usual level-matching constraint, but 
also the GSO constraints appropriate for that sector. In this way each un- 
derlying sector then gives rise to a different Fock space of states, and the 
full Hilbert space of states for the full string theory (i.e., for the resulting 
string “model”) is nothing but the direct sum of these different Fock spaces 
corresponding to each of the underlying sectors in the specified set. This 
then yields a fully self-consistent (and in particular, modular-invariant) 
theory. 

This is an important point, so it is worth repeating: the full Hilbert 
space of string states is given by the direct sum of the different Fock 
spaces corresponding to different underlying boundary conditions for world- 
sheet fields. In order to better understand this fact, an analogy with QCD 
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may be useful. Recall that Yang-Mills quantum field theory contains non- 
perturbative instanton solutions, and therefore one can imagine doing quan- 
tum field theory in an n-instanton background |n). Of course, as we know, 
the full vacuum state of QCD is not composed of any one of these |n) 
vacua by itself, but rather by an appropriately weighted combination of 
these vacua: 


Ja) = Sve jn). (6.4.2) 


This is the famous 6-vacuum of QCD. The situation that we now face in 
string theory is somewhat analogous. The fact that the string worldsheet 
fermions can have different boundary conditions (thereby giving rise to dif- 
ferent sectors) is in some sense analogous to the fact that QCD can have 
different instanton backgrounds. Indeed, each underlying string sector is 
analogous to a different n-instanton vacuum state |n), and the different 
“combinations of sectors” that we are now being forced to consider are 
analogous to the different QCD 6-vacua. In this sense, then, each different 
“string model” that we will be constructing can be viewed as a different 
-vacuum of string theory! Of course, this analogy with the QCD 6-vacuum 
can take us only so far. One important difference is that whereas the 6- 
vacuum necessarily involves all of the |n) states regardless of the value of 
0, in string theory our “vacuum” may consist of more complicated combi- 
nations of sectors which may or may not include all possible sectors. In 
fact, the more sectors that are included in our “combination of sectors”, 
the more GSO constraints there are for each sector. But the important 
lesson that emerges from all of this is that no single sector by itself forms a 
consistent string vacuum; rather, we must select an appropriate combina- 
tion of sectors and add together their corresponding Fock spaces in order 
to produce the fully self-consistent string model. 

In Lecture #7, we shall provide an explicit set of rules which will en- 
able us to quickly determine the appropriate sector combinations and GSO 
constraints that can be chosen in order to yield self-consistent theories. 
For the time being, however, we shall defer a discussion of these rules and 
proceed directly with the construction of actual string models in order to 
deduce their physical properties. Therefore, even though we shall simply 
assert certain sector combinations and GSO projections to be required by 
modular invariance, we stress that all of these features can (and ultimately 
will) be derived using the rules to be presented in Lecture #7. 


String Theory, String Model-Building, and String Phenomenology 307 


6.4.3. Ten-dimensional superstring models 


In the case of the ten-dimensional superstring, we have already seen that 
the four possible sectors are listed in Table 6.1. It then only remains to 
determine the particular sector combinations and GSO constraints that are 
required by modular invariance. In this case, it turns out that there are 
only two possible combinations or sets of sectors that can be considered: 


e we consider the contributions from only Sectors #1 and #2, or 
e we consider the contributions from all Sectors #1 through #4. 


Moreover, for each of the above cases, it turns out that there are two 
possible choices of GSO projections that may be imposed in each sector. 
Thus, combining all of these possibilities, we see that there are four distinct 
possible superstring “models” that can be constructed in ten dimensions. 
We shall therefore now turn to a construction of these models. 


6.4.3.1. The Type 0 strings 


Let us begin by considering the first option, taking our set of sectors to 
consist only of Sectors #1 and #2. For each of these sectors, we need to 
determine the appropriate GSO constraints that must be applied in addi- 
tion to the usual level-matching constraint. In order to write down these 
GSO constraints, let us first recall that for a given left-moving worldsheet 
fermion (with either Ramond or Neveu-Schwarz boundary conditions), the 
corresponding number operator is defined by 


R: N® = S 
n=0 


NS: NÖ = X` bn. (6.4.3) 
r=1/2 
Here the index 7 = 1,...,8 labels the individual fermion. For right-moving 
fermions, the analogous number operators ÑC) are constructed using the 
right-moving mode operators bn, b,. Let us also define Nz and Nr respec- 
tively as the total left- and right-moving number operators, i.e., 


8 8 
N = X N°, Ne = X ÑO. (6.4.4) 
yal {=f 


Note that these number operators are defined to include only the contri- 
butions of the worldsheet fermions, and in particular do not include the 
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contributions of the worldsheet bosons. It then turns out (and we shall 
see in Lecture #7) that if we choose our set of sectors to consist only of 
Sectors #1 and #2, then the appropriate GSO constraints in each sector 
are as follows: 


Sector #1: Nr — Nr = even 
Sector #2: NL- Ne = { oad \ (6.4.5) 
even 


In the second line, we have used a brace notation to indicate a further 
choice: we can choose to impose either the ‘odd’ constraint, or the ‘even’ 
constraint. As we shall see, this is a residual choice that is not fixed by 
modular invariance (or by any other worldsheet symmetry), leading to two 
equally valid possibilities. Thus, we see that if we choose our set of sectors 
to consist of only Sectors #1 and #2, then this leads to two different string 
models depending on our subsequent choice of which GSO constraint we 
choose to impose in (6.4.5). 

Let us now determine the spectra of these two models, beginning with 
the states that arise from Sector #1. Note that in this sector, both models 
have the same states (because both models have the same GSO constraint 
for Sector #1). As with the bosonic string, our procedure is to consider 
all possible excitations of the worldsheet fields (in this case, the worldsheet 
fermions as well as the worldsheet bosons). These excitations are subject 
to the level-matching constraint Lo = Lo (which ensures that the total 
bosonic and fermionic worldsheet energy is distributed equally between 
left- and right-moving excitations) and the GSO constraint Nz — Np = even 
(which is a constraint on the worldsheet number operators of the worldsheet 
fermions only). In general, the mass-shell condition for the superstring is 


a'M? = 2(Lo+Lo+ar+ar) (6.4.6) 


where az and ap are the individual left- and right-moving vacuum energies, 
and where Lo and Lo include the contributions from not only the worldsheet 
bosons, but also the worldsheet fermions. Note from Table 6.1 that the left- 
and right-moving vacuum energies in Sector #1 are ay = ar = —1/2. 

We see that the tachyonic vacuum state |0) p Q |0); satisfies both con- 
straints, and thus it remains in the spectrum. However, unlike the tachyon 
in the bosonic string (which has spacetime mass a’ M? = —4), we see 
from (6.4.6) that the tachyonic state in the superstring has spacetime mass 
a’ M? = —2. This is the result of the smaller (less negative) vacuum energy 
of the superstring compared to that of the bosonic string. 
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Because the vacuum energies in Sector #1 are a, = ar = —1/2, we see 
that massless states cannot be obtained by exciting the quantum modes of 
worldsheet bosons, for each of these excitations would add a full unit of 
energy. Instead, massless states can be obtained only by adding a half-unit 
of energy. Fortunately, this is possible in Sector #1 because in this sec- 
tor, all worldsheet fermions have Neveu-Schwarz boundary conditions and 
therefore have half-integer modings. The first excited states in Sector #1 
are therefore 


bY 1210) p @ b 4/210), - (6.4.7) 


Note that these states satisfy both the level-matching constraint (since 
Lo = Lo = 1/2) as well as the GSO constraint (since Ng = Nr = 1). 
The interpretation of these states is precisely the same as in the bosonic 
string: these states give us the gravity multiplet, consisting of the graviton 
guv, dilaton ¢, and anti-symmetric tensor B,,. Mathematically, this is 
equivalent to the tensor-product rule for Lorentz transverse SO(8) vector 
representations: 


Vg @Vg = 1 © 28 © 35 (6.4.8) 


where Vg is the eight-dimensional vector representation, and where the 
1 representation is the spin-zero state, the 28 representation is the spin- 
one state, and the 35 representation is the spin-two state. It is indeed a 
general principle that all weakly coupled closed strings contain at least these 
massless states, and this is a useful cross-check of the GSO constraints. 

Let us now turn to the states from Sector #2. Before concerning our- 
selves with the implication of the GSO constraints in (6.4.5), let us first 
understand the general structure of the states from this sector. In this sec- 
tor, the vacuum energy (according to Table 6.1) is (az,az) = (0,0), so we 
see immediately that this sector contains no tachyons. Indeed, the ground 
state is already massless, so all that will concern us here is the nature of 
this ground state. As we discussed at the end of Lecture #3, the left- and 
right-moving ground states in this sector are each spacetime spinors since 
all worldsheet fermions in this sector have Ramond boundary conditions. 
Because the nature of these spinors will be important to us, let us pause to 
review some properties of these spinors. 

Since we are considering these ten-dimensional strings in light-cone 
gauge, the Lorentz group that concerns us here is the transverse (“little”) 
Lorentz group SO(8). In general, the groups SO(2n) share a number of 
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properties. Their smallest representations, of course, are simply the iden- 
tity representations. These are singlets, which will be denoted 1. The next 
representations are the vector representations, which are (2n)-dimensional, 
and which will be denoted V2,. Along with these are the spinor repre- 
sentations, which are (2”~')-dimensional. In general, there are two types 
of spinor representations, S and C, the so-called “spinor” and “conjugate 
spinor” representations. In the special case of SO(8), the vector, spinor, 
and conjugate spinor representations are all eight-dimensional, and will be 
denoted Vg, Sg, and Cg respectively. The distinction between Sg and Cg 
is one of spacetime chirality, but the choice of which is to be associated 
with a given physical chirality is a matter of convention. 
The ground state of Sector #2 has the structure 


{05} 10) 2 © {05} 10), (6.4.9) 


where the notation {b} } (and similarly for the right-movers) indicates that 
each of the individual Ramond zero-modes can be either excited or not 
excited. 

How can we interpret (6.4.9) physically? This issue is actually quite sub- 
tle, and we shall not have the space to give a proper discussion. Moreover, 
as we have already indicated, we are not giving a fully rigorous treatment 
of Ramond zero-modes in these lectures, since our aim is to focus more on 
the physics than the formalism. However, it is possible to understand the 
appropriate physical interpretation intuitively. First, let us count the num- 
ber of states in (6.4.9). A priori, it would seem that we have 216 individual 
states, since each Ramond fermion zero-mode can either be excited or not 
excited. However, this is not correct because (as we shall discuss more 
completely in Lecture #5, and as we have already hinted in the footnote in 
Sec. 3.2), one should really count only one zero-mode per pair of Ramond 
Majorana-Weyl fermions. Thus, we can imagine that there are only four 
independent zero-modes for the right-movers, and four for the left-movers. 
Therefore, (6.4.9) consists of only 2° = 128 states. 

All combinations of these zero-mode excitations already satisfy the level- 
matching constraint (since Lo = Lo = 0). Imposing either of the GSO 
constraints for Sector #2 in (6.4.5) then reduces the number of allowed 
states by a factor of two. Specifically, if we impose the constraint Nz — 
Nr = odd, then we can choose only an even number of right-moving zero- 
mode excitations together with an odd number of left-moving zero-mode 
excitations, or an odd number of right-moving excitations together with an 
even number of left-moving excitations. Choosing the constraint Ng —NpR = 
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even has the opposite effect, pairing even numbers of excitations for left- 
and right-movers with each other, and likewise pairing odd numbers with 
each other. 

Interpreting these results is therefore quite simple. As we discussed 
at the end of Lecture #3, the left-moving states and right-moving states 
are spacetime spinors, and we have already seen that there are two possible 
spinors, Sg and Cg. At this stage, the names assigned to each are arbitrary, 
so we shall now establish the following convention: spinors realized by an 
even number of zero-mode excitations will be identified with Cg, and those 
realized by an odd number of zero-mode excitations will be identified with 
Ss. Of course, only the relative difference between these two spinors is 
physically significant (having the interpretation of spacetime chirality). 

Given these definitions, we see that if we choose the first GSO constraint 
Nz — Nr = odd, the 128 states in (6.4.9) decompose into 


(Cg @Sg) © (Sg @Cz) , (6.4.10) 


whereas if we choose the second GSO constraint Nz; — Nr = even, these 
states instead decompose into 


(Cs © Cs) © (Sg @Sz) . (6.4.11) 


If we wish to further decompose these states into representations of the 
Lorentz group, we can use the SO(8) tensor-product relations 


Ss Q Ss = 16 28 $ 35' 
Cs ® Cs = 16 28 6 35” 
Ss Q Cs = Va $ 56 . (6.4.12) 


Here the 28 representation is the anti-symmetric component of the spinor 
tensor product (spin-one), while the 35’ and 35” representations are the 
symmetric components of the spinor tensor product (also spin-one). (These 
latter representations are not to be confused with the spin-two 35 graviton 
representation in (6.4.8).) Likewise, the 56 is a certain vectorial (spin-one) 
higher-dimensional representation.* However, for our present purposes it 
*For the mathematically inclined reader, we can succinctly describe all of these states as 
follows. Recall that a given representation is called a p-form if it can be realized as the 
totally anti-symmetric combination within the tensor product of p different vector indices 
of SO(8), with resulting dimension 8 x 7x 6x... x (9—p)/p!. Using this language, we see 
that singlet states are zero-forms, the 28 representations are two-forms, and the 35’ and 
35” representations are “self-dual” four-forms. (The self-duality condition eliminates 


exactly half of the degrees of freedom in the four-form.) Likewise, the Vg state is a 
one-form, and the 56 representation is a three-form. These different forms (and the 
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will be sufficient to think of these states in the tensor-product forms (6.4.10) 
and (6.4.11). Note that in each case, the tensor product of two spacetime 
fermionic (spinor) states produces a spacetime bosonic state. Thus, just as 
in Sector #1, the states emerging in Sector #2 are spacetime bosons. 

Thus, summarizing, we see that the spectra of our two resulting super- 
string models are as follows. First, from Sector #1, we have the tachyonic 
state |0) p ® |0),. In the notation of SO(8) Lorentz representations, this 
state may be denoted 1 ® 1; this tachyon is a Lorentz singlet. Next, we 
have the massless gravity multiplet. In the notation of SO(8) Lorentz rep- 
resentations, this state takes the form Vg ® Vg. Finally, from Sector #2, 
we have massless states whose form depends on the particular choice of the 
GSO projection. In the first case, we have the states given in (6.4.10), while 
in the second case, we have the states given in (6.4.11). There are then, as 
usual, an infinite tower of massive (Planck-scale) states above these. 

The string model produced by the first GSO projection is called the 
Type OA string model, and the second is called the Type 0B string model. 
Collectively, these are sometimes simply called the Type 0 strings. As 
we see, both of these strings are tachyonic, and moreover they contain 
only bosonic states. Furthermore, as is evident from (6.4.10) and (6.4.11), 
both of these strings are non-chiral. In other words, they are invariant 
under the transposition Sg +> Cg for the left- and right-movers. These 
string theories were first constructed by N. Seiberg and E. Witten in 1985. 
Although not relevant for phenomenology, they are currently proving to 
have an important role in understanding certain non-perturbative aspects 
of non-supersymmetric string theory. 


6.4.3.2. The Type IT strings 


Let us now turn to the second choice outlined at the beginning of Sec. 4.3, 
namely the case in which we consider the contributions from all of the 
sectors in Table 6.1. This will result in the so-called Type IT strings. As 
we discussed at the end of Sec. 4.2, it is a general property that the larger 
the set of sectors that we consider, the more GSO constraints there are 
that must be imposed in each sector. Thus, the introduction of new sectors 
generally leads to new GSO constraints in each of the sectors (old and 
new), and likewise the introduction of new GSO constraints in a given 
sector requires the introduction of entire new sectors to compensate. 


so-called D-branes whose existence they imply) are important when considering the 
non-perturbative structure of these string theories. 
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It turns out (and we shall see explicitly in Lecture #7) that if we consider 
the full set of sectors in Table 6.1, then the appropriate GSO constraints 
in each sector are given as follows: 


Sector #1: Nr —Nr=odd, Npr = odd 
odd 


even 


Sector #2: N,-Ne={ \ , Nr=odd 


Sector #3: Nr —Nr=even, Nr = odd 
odd 


even 


Sector #4: NL = Nr = { \ ; NR =odd. (6.4.13) 


Note that in each case where a choice is possible, these choices are corre- 
lated: we simultaneously choose either the top lines within all braces, or 
the bottom lines. Thus, once again there are two sets of GSO conditions 
that can be imposed, resulting in two distinct string models. 

Before proceeding further, it is useful to note the pattern of these GSO 
projections. In the case of the Type 0 strings, we considered only Sec- 
tors #1 and #2; as shown in Table 6.1, these were the sectors for which 
the right-moving fermions were always identical to the left-moving fermions 
and shared the same boundary conditions. The corresponding GSO projec- 
tions in (6.4.5) likewise did not distinguish between right- and left-moving 
fermions. (In this context, note that the GSO projections in (6.4.5) can 
equivalently be written with minus signs replaced by plus signs.) Thus, in 
some sense, the Type 0 strings are symmetric under exchange of left- and 
right-movers. However, for the Type II strings, we have now introduced two 
additional sectors (Sectors #3 and #4) whose structure explicitly breaks 
this symmetry between left- and right-movers. No longer does each sec- 
tor individually exhibit this left/right symmetry. As we see from (6.4.13), 
the effect of this breaking is to introduce additional GSO conditions which 
mirror this broken symmetry by becoming sensitive to right- or left-moving 
number operators by themselves. The technical word for this breaking of 
symmetry is “twisting” or “orbifolding”, for by including Sectors #3 and 
#4, we see that we have twisted the left-movers relative to the right-movers 
by allowing them to have oppositely moded boundary conditions. Thus, the 
Type II strings that will result can be viewed as twisted (or orbifolded) ver- 
sions of the Type 0 strings. This twisting procedure ultimately serves as 
the means by which more and more complicated (and more and more phe- 
nomenologically realistic) string models may be constructed, and will be 
discussed more fully in Lecture #7. 
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Given the GSO constraints in (6.4.13), we can proceed to determine the 
resulting spectrum just as we did for the Type 0 strings. Let us begin with 
Sector #1 (this is often called the “NS-NS sector”). Because the boundary 
conditions of the worldsheet fermions are the same in this sector as they 
were for the Type 0 strings, the possible states that arise are the same as 
they were for the Type 0 strings, and consist of the tachyon |0) p & |0) ; as 
well as the gravity multiplet (6.4.7). The only difference is that we must now 
impose the additional GSO constraint Nr = odd. It is immediately clear 
that the effect of this new GSO constraint is that the tachyon is projected out 
of the spectrum, while the gravity multiplet is retained. Thus, by “twisting” 
the Type 0 strings in just this way, we have succeeded in curing one of the 
major problems of the bosonic and Type 0 strings, namely the appearance 
of tachyons. Moreover, we have done this without eliminating the desirable 
gravity multiplet. 

Let us now consider the states from Sector #2 (this is often called the 
“Ramond-Ramond” sector). Once again, if we impose only the first GSO 
constraint in (6.4.13), we obtain the states in either (6.4.10) or (6.4.11). Im- 
posing the additional GSO constraint in (6.4.13) then enables us to project 
out half of these states, so that we retain only the states 


Ss @ En . (6.4.14) 


These states are spacetime bosons. 

Finally, let us consider the states that arise in the new Sectors #3 and 
#4. In Sector #4, the vacuum energy is (ar,a) = (—1/2,0). Therefore, 
in order to have level-matching (Lo = Lo), we see that we are immediately 
forced to excite a half-unit of energy for the right-movers while not increas- 
ing the energy of the left-movers. This is the only way to produce a massless 
state. This also ensures that this sector does not give rise to tachyons. For- 
tunately, since the right-moving fermions have Neveu-Schwarz boundary 
conditions in this sector, these fermions have half-integer modings, and 
thus by exciting their lowest modes we can indeed introduce a half-unit 
of energy. The left-moving fermions have Ramond boundary conditions in 
this sector, and hence their ground state is the Ramond zero-mode state. 
The massless states in Sector #4 therefore take the form 


bY 1210p & {b5} 10), - (6.4.15) 


At this stage, of course, these states satisfy only the level-matching con- 
straint. Imposing the GSO constraints then leaves us with the state in 
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which we excite only an even (or odd) number of left-moving Ramond zero 
modes. 

How can we interpret this state? First, we notice that this state is a 
spacetime fermion because it results from tensoring a right-moving Neveu- 
Schwarz state with a left-moving Ramond state. Thus, we now have a 
string theory that contains spacetime fermions! This is yet another benefit 
of performing the “twist” that takes us from the Type 0 strings to the 
Type II strings. However, let us examine this state a bit more closely. 
Clearly, it has the Lorentz structure 


Vs 9 ee, (6.4.16) 


where we have retained the spinor-labelling conventions that we employed 
for the Type 0 strings. The relevant tensor-product decompositions in this 
case are given by 


Vs ® Cg = Sg © 56’ 
V; 2 Ss = Cg @ 56” (6.4.17) 


where the Sg and Cs representations are spin-1/2 and where the 56’ and 
56” representations are spin-3/2. Thus, we see that the Type II strings 
contain a massless, spin-3/2 object! Just as a massless spin-two object 
satisfies the Einstein field equations and must be interpreted as the graviton, 
a massless spin-3/2 object must be interpreted as a gravitino — i.e., a 
superpartner of the graviton. This implies that this string not only gives rise 
to spacetime bosons and fermions, but actually gives rise a spectrum which 
exhibits spacetime supersymmetry! This is yet another phenomenologically 
compelling feature. 

Finally, let us now consider Sector #3. This sector has vacuum energies 
(ar, az) = (0,—1/2), so now we must excite right-moving zero-modes and 
left-moving b”; /2 modes. This then leads to states of the form 


{56} |0) p ® bP i2 |0) > ) (6.4.18) 


and imposing the GSO projections results in states with the Lorentz struc- 
ture Sg ® Vg. Once again, this also contains a gravitino! 

So what do we have in the end? The first choice of GSO projections re- 
sults in the so-called Type IIA string, while the second choice results in the 
Type IIB string. Both of these strings are tachyon-free, and their spectra 
contain both bosons and fermions. Moreover, these strings exhibit space- 
time supersymmetry. This is most easily seen in the following suggestive 
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way. Let us collect together the states from all four sectors, retaining our 
Lorentz-structure tensor-product notation: 


Vs 8 Vs, solih, vof h, Ss & Vs. (6.4.19) 
8 


Together, this collection of states can be written in the factorized form 


(Vs @ 53) ® (vs © { . \) (6.4.20) 


We thus see that there are two spacetime supersymmetries exhibited in this 
massless spectrum: the first exchanges Vg Sg amongst the right-movers, 
while the second exchanges 


Vgc a \ (6.4.21) 


amongst the left-movers. Thus, the massless spectrum exhibits N = 2 
supersymmetry. This is, of course, consistent with the appearance of two 
gravitinos in the massless spectrum (one from Sector #3 and one from 
Sector #4). Another way to understand this N = 2 supersymmetry is to 
realize that the first supersymmetry relates the bosonic states in Sector #1 
to the fermionic states in Sector #3 (and the bosons in Sector #2 to the 
fermions in Sector #4), while the second supersymmetry relates the bosons 
in Sector #1 to the fermions in Sector #4 (and the bosons in Sector #2 to 
the fermions in Sector #3). In either case, we thus see that we have two 
independent spacetime supersymmetries. 

It is important to note that we did not demand spacetime supersymme- 
try when constructing the superstring. We merely introduced worldsheet 
supersymmetry, and found that spacetime supersymmetry emerged nat- 
urally as the result of certain GSO projections. This further illustrates 
the fact that in string theory, spacetime properties such as supersymmetry 
emerge only as the consequences of deeper, more fundamental worldsheet 
symmetries. Another important point is that the same “twist” which elim- 
inated the tachyon has introduced spacetime supersymmetry. While this 
is certainly an interesting phenomenon that arises for ten-dimensional su- 
perstrings, it is certainly not a general property that the elimination of the 
tachyon requires spacetime supersymmetry. In particular, we shall see in 
Lecture #6 that it is possible to construct string theories whose tree-level 
spectra lack spacetime supersymmetry but nevertheless are tachyon-free. 
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One might question whether we have really demonstrated the existence 
of N = 2 supersymmetry, since we have examined only the massless spec- 
trum. However, it can be shown that any unitary theory which contains a 
massless spin-3/2 state necessarily exhibits supersymmetry, and hence must 
be supersymmetric at all mass levels (7.e., for all massive, excited states as 
well). Of course, this is still not a proof, since we do not a priori know (and 
would therefore need to verify) that string theory is a consistent theory in 
this sense. However, it is possible to construct (two) explicit spacetime su- 
percurrent operators and to demonstrate that they commute with the full 
(massless and massive) spectrum of the string. Another approach (as indi- 
cated in the footnote in Sec. 3.1) is to develop an alternative formulation 
of the superstring in which spacetime (rather than worldsheet) supersym- 
metry is manifest at the level of the string action, and to demonstrate the 
equivalence of the two formulations. Indeed, both approaches have been 
successfully carried out, thereby demonstrating that the Type II spectrum 
is indeed N = 2 supersymmetric. It is for this reason that these strings are 
referred to as Type II strings. 

One important distinction between these two strings is their chirality. 
The Type IIA string, as we see, contains two supersymmetries of oppo- 
site chiralities, interchanging Vg © Sg for the right-movers and Vg © Cg 
for the left-movers. Equivalently, the two gravitinos associated with these 
supersymmetries are of opposite chiralities (because the 56’ and 56” rep- 
resentations in (6.4.17) are of opposite chiralities). Because it contains su- 
persymmetries of both chiralities, this string is ultimately non-chiral, and 
its low-energy (field-theoretic) limit consists of so-called Type ITA super- 
gravity (whose discovery predates that of the Type IIA string). It is for 
this reason that this string is called the Type HA string. The Type IIB 
string, by contrast, contains two supersymmetries (or two gravitinos) of the 
same chirality, exchanging Vg © Sg and Vg © Sg respectively. Thus, this 
string theory is chiral, and has a low-energy field-theoretic limit consisting 
of Type IIB supergravity. 

We conclude, then, that by introducing a twist relative to the Type 0 
strings, we have constructed a set of strings (the Type IIA and Type IIB 
strings) that exhibit a number of compelling features: they are tachyon- 
free, they contain both bosons and fermions in their spacetime spectra, they 
contain gravity, and they are spacetime N = 2 supersymmetric. Despite 
this success, however, there is still something that we lack: we do not, as 
yet, have gauge symmetries. Specifically, there are no gauge bosons (such as 
photons, gluons, or W and Z particles). Likewise, there are no states which 
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carry gauge charges. Therefore, once again, we shall need to construct a 
new kind of string. 


6.5. Lecture #5: Neutrinos have gauge charges: The het- 
erotic string 


6.5.1. Motivation and alternative approaches 


Thus far in these lectures, we have shown how string theory can give rise 
to quantized gravity, spacetime bosons and fermions, spacetime supersym- 
metry, and tachyon-free spectra. There is, however, one important phe- 
nomenological feature that is still missing: gauge symmetry. In other words, 
we wish to have massless gauge bosons, i.e., spacetime vectors that trans- 
form in the adjoint representation of some internal symmetry group. As a 
side issue, we would also like to find a way of breaking N = 2 supersym- 
metry to N = 1 supersymmetry (if our goal is to reproduce the MSSM) or 
even to N = 0 supersymmetry (if our goal is to reproduce the Standard 
Model). 

It is worth considering why such gauge-boson states fail to appear for 
the ten-dimensional Type II strings discussed in the previous lecture. The 
problem is the following. In order to produce worldsheet bosons, we are 
restricted to considering only the NS-NS or Ramond-Ramond sectors (Sec- 
tors #1 and #2 in Table 6.1). In the NS-NS sector (Sector #1), the 
vacuum energy is (ar,az) = (—1/2,—1/2), so we must excite the half- 
energy fermionic mode oscillators b” i jx bË 1/2 for the both the left- and 
right-movers. This produces a state with two vector indices rather than 
one, and as we see from the vector-vector tensor-product decomposition in 
(6.4.8), this does not contain a vectorial state. In the Ramond-Ramond 
sector (Sector #2), by contrast, the vacuum energy is (ar,az) = (0,0), 
which implies that our massless states comprise the tensor product of two 
Ramond spinors as in (6.4.10) for the Type IIA string, or as in (6.4.11) 
for the Type IIB string. In the case of the Type IIB string, we see from 
(6.4.12) that the tensor product Sg @ Sg does not contain a vector state 
Vs. Thus, the Type IIB string contains no massless vectors. In the case of 
the Type IIA string, we observe from (6.4.17) that indeed Sg 9 Cg D Vs, 
and thus the Type IIA string does contain a massless vector. (This state 
is often called a “Ramond-Ramond gauge boson”.) However, the U(1) 
“gauge” symmetry associated with this state is too small to contain the 
Standard-Model gauge group, and moreover it can be shown that no states 
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in the perturbative spectrum of the Type IIA string spectrum can carry 
this Ramond-Ramond charge.* 

In each case, the fundamental obstruction that we face is that we need 
to generate representations of a gauge group (i.e., an internal symmetry 
group) that is different from the Lorentz group. Until now, all of our 
worldsheet fields (such as Xý g and {FP g) have carried Lorentz indices 
associated with the SO(D — 1,1) Lorentz symmetry. In order to produce 
a separate gauge symmetry, we therefore need fields which do not carry a 
Lorentz index but which carry a purely internal index. (Note that these 
fields cannot carry a Lorentz index because we ultimately want our gauge 
symmetries to commute with the Lorentz symmetries.) 

How can we do this? One idea is to compactify the Type II strings that 
we constructed in the previous lecture. Although this approach ultimately 
fails for phenomenological reasons, it will be instructive to briefly explain 
this idea. Recall that for the superstring, the critical dimension D = 10 
emerges as the result of an anomaly cancellation argument: each world- 
sheet boson X contributes cx = 1, each Majorana fermion Y” contributes 
Cy = 1/2, and thus ten copies of each are necessary in order to cancel the 
“background” central charge associated with the worldsheet superconfor- 
mal symmetry. But, even though we require ten bosons and ten fermions, 
there is no reason why we must endow all of them with Lorentz vector 
indices u. Since we are ultimately interested in four-dimensional string 
theories, one natural idea is to consider these ten bosons and ten fermions 
in two groups, four with indices = 0,1,2,3, and the remaining six with 
purely internal indices i = 1,...,6. This internal symmetry could then be 
interpreted as a gauge symmetry. 

This idea is in fact reminiscent of the original Kaluza-Klein idea whereby 
gauge symmetries are realized from higher-dimensional gravitational theo- 
ries upon compactification. Moreover, this idea does succeed in producing 
gauge bosons (and gauge symmetries) in dimensions D < 10. However, 
the problem is that this idea fails to produce enough gauge symmetry. 
Specifically, although we obtain gauge symmetries that are large enough to 
contain the Standard Model gauge symmetry SU(3) x SU(2) x U(1), we 
cannot obtain massless representations that simultaneously transform as 


*Despite this fact, Ramond-Ramond charge plays a crucial role in recent developments 
concerning string duality. While none of the states in the perturbative Type IIA string 
spectrum carry Ramond-Ramond charge, these strings also contain non-trivial solitonic 
states (so-called D-branes) which do carry Ramond-Ramond charge. We shall briefly 
discuss D-branes in Lecture #8. 
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triplets of SU(3) and doublets of SU(2). Such “quark” representations are 
required phenomenologically. Thus, even though this compactification idea 
is interesting as a way of generating certain amounts of gauge symmetry, it 
cannot be used in order to save the superstring. 

What we require, then, is a different way of introducing worldsheet 
fields without Lorentz vector indices. Since we will (temporarily) abandon 
the idea of removing Lorentz indices from our ten worldsheet bosons and 
fermions, what this means is that we require a way of obtaining even more 
worldsheet fields in ten dimensions. In other words, if we want bigger 
gauge symmetries in D = 4, then we require more than six extra fields with 
internal indices 7, which in turn means that we already want extra fields 
even in the original ten-dimensional interpretation. 

But how can we introduce extra worldsheet fields without violating our 
previous conformal anomaly cancellation arguments? Just adding extra 
fields will reintroduce the conformal anomaly at the quantum level. 


6.5.2. The heterotic string: Constructing the action 


The idea, of course, is to abandon the Type II string and proceed to con- 
struct a new kind of string that can accomplish the goal. This string is 
called the heterotic string, and it is this string that will be our focus for the 
remainder of these lectures. This string was first introduced by D. Gross, 
J. Harvey, E. Martinec, and R. Rohm in 1985, and for more than a decade 
dominated (and still continues to play a pivotal role in) discussions of string 
phenomenology. 
Let us begin by recalling the action of the bosonic string: 


fès {(0- XE)? + (04 X6}. (6.5.1) 


Here the worldsheet symmetry is simply conformal invariance, which re- 


Sbosonic ee ES 
Ara! 


quires that we take u = 0,1, ...,25 in order to cancel the conformal anomaly. 

Clearly, this action contains lots of worldsheet fields. However, we saw in 

Lecture #2 that this string does not give rise to spacetime fermions. 
Next, we considered the superstring, whose action is given by: 


1 
Ssuper = -z |e {(@_X%)? — WRO_WRy + (84 X6)? = Ytou} x 
(6.5.2) 
Here the worldsheet symmetry is superconformal invariance, which requires 
that we take u = 0,1,...,9 in order to cancel the superconformal anomaly. 
Unlike the bosonic string, this string gives rise to spacetime fermions. But 
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as we have just explained, this string does not contain enough worldsheet 
fields to give rise to appropriate gauge symmetries. 

Clearly, each of these strings has an advantage lacked by the other. The 
natural solution, then, is to attempt to “weld” them together, to “cross- 
breed” them in such a way as to retain the desirable attributes of each. But 
how can this be done? 

The fundamental observation is that we are always dealing with closed 
strings, and for closed strings, we have seen that the left- and right-moving 
modes are essentially independent of each other and form separate theories. 
Indeed, only the level-matching constraint Lo = Lo serves to relate these 
two halves to each other, but even this constraint applies at the level of the 
physical Fock space rather than the level of the action. Therefore, since 
these two halves are essentially independent, a natural idea is to construct 
a new hybrid string whose left-moving half is the left-moving half of the 
bosonic string, but whose right-moving half is the right-moving half of the 
superstring. As we shall see, this fundamental idea is just what we need. 
The resulting string is therefore called a heterotic string, where the prefex 
hetero- indicates the joining of two different things. 

Given this idea, let us now see how the action for the heterotic string 


can be constructed. We shall do this in three successive attempts. Our first 
attempt would be to write an action of the form 


1 


S = — 
4ra 


J d?o {(0-XH)? — WhO_wry + (04X#)2}. (6.5.3) 


In this case, the worldsheet symmetry would be conformal invariance for 
the left-movers, but superconformal invariance for the right-movers. 

But what is the spacetime dimension of such a string? If we consider 
the right-moving sector, then just as in the superstring we would require 
D = 10, so that u = 0,1,...,9. But given this, how do we interpret the left- 
moving side of the heterotic string? On the left-moving side, cancellation of 
the conformal (rather than superconformal) anomaly requires that we still 
retain 26 X fields! But if only ten of these fields are spacetime coordinates, 
then the remaining sixteen must be mere internal scalar fields. In other 
words, rather than carry the u index (which would imply that these X fields 
would transform as vectors under the spacetime Lorentz group SO(9,1)), 
these sixteen extra fields must instead carry a purely internal index i = 
1,...,16. So our second attempt at writing a heterotic string action would 
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result in an action of the form 


1 
4ra’ 


S = -goy | Po {(0-X4? - Whd-vru + XE? + (XL?) 

(6.5.4) 
where we have explicitly separated the left-moving bosons into two groups, 
with u = 0,1,...,9 and i = 1,...,16. 

But there still remains a subtlety. We cannot simply decide to remove 
the u index from the X fields and make no other changes, because these 
Xt fields would continue to have a mode-expansion of the form (6.2.9) with 
the u index replaced by an internal index i. While the interpretation of 
the oscillation exponential terms in (6.2.9) is not problematic, how would 
we interpret the “zero-mode” terms x’ + ¢?p'(a1 +2)? In the case of the 
spacetime coordinate fields X”, recall that these “zero-mode” quantities x’ 
and p? are interpreted as the center-of-mass position and momentum of the 
string. But for purely internal fields X*, this interpretation is problematic. 
To clarify this difficulty, let us consider the worldsheet energy oe) associ- 
ated with these degrees of freedom, as in (6.2.13). Just as in the case of the 
spacetime coordinates X“, these worldsheet energies for the X* fields would 
a priori take continuous values, thereby leading to a continuous spectrum 
even in D = 10. A continuous spectrum, of course, indicates nothing but 
the appearance of extra spacetime dimensions, so even though we may have 
replaced the index u with the index i, we have not really solved the funda- 
mental problem that there are too many uncompactified degrees of freedom 
amongst the left-movers. 

Therefore, we still must find a way to replace this continuous spectrum 
with a discrete one. Because the following discussion is slightly technical 
and outside the main line of the development of the heterotic string action, 
we shall separate it from the main flow of the text. The reader uninterested 
in the following details can skip them completely and proceed directly to 
the resumption of the main text. 


In order to eliminate this continuous spectrum, we must 
compactify these extra sixteen dimensions. This is anal- 
ogous to discretizing the continuous spectrum of a free 
particle (plane wave) by localizing it in a box. In the 
present case, we can choose to compactify each of these 
extra spacetime “coordinates” Xt on a circle of radius R;. 
What this means, operationally, is that we make the fol- 
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lowing topological identification in spacetime: 
X? <> X*+9nR; . (6.5.5) 


For simplicity (and as we shall see, without loss of gener- 
ality), we shall take R; = R for all i. Thus, rather than 
demand simple periodicity of the X* “coordinates” as in 
(6.2.6) as we traverse the closed string worldsheet, we must 
allow for the more general possibility 


X*(o,+7,02) = X*(o1,02)+20n;R, nic zZ (6.5.6) 


where the integer n; is called the “winding number”. The 
interpretation of this condition is that as we traverse the 
closed string once on the worldsheet (i.e., as o1 > o1 +7), 
the spacetime “coordinate” field _X* traverses the compact- 
ified spacetime circle n; times. In other words, the closed 
string “winds” around the it! compactified spacetime cir- 
cle n; times. Because of this compactification, we see that 
the momentum p’ is now quantized (as we would expect 
for any particle in a periodic box of length R), and is re- 
stricted to take the values pê = m;/R, m; € Z. Indeed, 
working out the most general mode-expansion consistent 
with (6.5.6), we find that a given such coordinate X* takes 
the form 


X(o1,02) = x + 2nRoı + Oso + oscillators , (6.5.7) 


where £ = vV2a’ is our fundamental length scale and 

where ‘oscillators’ generically denotes the higher frequency 

modes. This decomposes into left- and right-moving com- 

ponents 

/ 

XzR(o1 £02) = sat ca T nk) (o2+01)+oscillators i 
(6.5.8) 

Comparing (6.5.8) with (6.2.9) enables us to identify the 

left- and right-moving compactified momenta 


om nR 
pem Eg 
We would then simply keep Xz in our heterotic theory. 


Let us pause here to note an interesting phenomenon: 


(6.5.9) 


this mode-expansion is invariant under the simultaneous 
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exchange R > a’/R, m + n. This is a so-called T- 
duality. What this means is that unlike point particles, 
strings cannot distinguish between extremely large space- 
time compactification radii and extremely small spacetime 
compactification radii. Indeed, although the usual momen- 
tum m/R is extremely small in the first case and extremely 
large in the second, we see from the above mode-expansions 
that there is another contribution to the momentum, a 
“winding-mode momentum” nR/a’, which compensates 
by growing large in the first case and small in the sec- 
ond. Since there is no physical way of distinguishing be- 
tween these two types of momenta, the string spectrum is 
ultimately invariant under this T-duality symmetry. This 
duality underlies many of the unexpected physical proper- 
ties of strings relative to point particles, and has important 
(and still not well-understood) implications for string cos- 
mology. More importantly, however, this duality dramat- 
ically illustrates the breakdown of the traditional (field- 
theoretic) view of the linearly ordered progression of length 
scales and energy scales as we approach the string scale. 

Having succeeded in avoiding the consequences of a 
continuous momentum pł, our final question is the size of 
the radius R. It would certainly be aesthetically undesir- 
able if we were forced to incorporate a new, fundamental, 
unfixed parameter R into our string theory. Fortunately, 
it turns out that in D = 10, there are only a very re- 
stricted set of possibilities that lead to consistent theories, 
and these restrictions imply that we can restrict our at- 
tention to the simple case R = £ = V2a’ without loss of 
generality. Thus, we see that R can be taken to be at the 
string scale, and hence essentially unobservable to “low- 
energy” measurements. 

In order to see what is special about this radius, recall 
that the conformal anomaly contribution for each world- 
sheet boson is cx = 1, while the conformal anomaly con- 
tribution for each worldsheet Majorana (real) fermion is 
cy = 1/2. This suggests that the spectrum of a single 
compactified boson X might somehow be related to the 
spectrum of two Majorana fermions 71, 2, and this is in- 
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deed the case. Such a relation is typically referred to as a 
“boson-fermion equivalence” (which is possible in two di- 
mensions because the usual spin-statistics distinction be- 
tween bosons and fermions does not apply in two dimen- 
sions). In general, the spectrum of a compactified boson is 
identical to the spectrum of two Majorana fermions which 
are coupled to each other in a radius-dependent manner, 
and R = V2a’ is the only value of the radius for which 
this coupling vanishes. Thus, if X is compactified on a 
circle of radius R = V2a’, then the spectrum of quantum 
excitations of X is identical to the spectrum of quantum 
excitations of two free Majorana fermions %1, Y2 (or equiv- 
alently those of one complex fermion V = yı + iy2).4 In 
fact, at a mathematical level, it turns out that this equiv- 
alence takes the form of an actual equality between the 
product ıp and the partial derivative 0X. Note, how- 
ever, that while this specific radius is special from the point 
of view of boson/fermion equivalence, this is not the self- 
dual radius with respect to the T-duality transformation 
Read /R. 


The upshot, then, is that in the action (6.5.4), we are free to replace 
the worldsheet bosons X* (i = 1,...,16) with complex worldsheet fermions 
WV’ (i = 1,...,16). For ten-dimensional heterotic strings, we shall see that 
this replacement can be made without loss of generality. This replacement 
suffices to make the center-of-mass “momenta” associated with the X* fields 
discrete rather than continuous, as we require. Given this, the final action 
for the heterotic string takes the form: 


1 
4ra’ 


Sheterotic = 


i do {(O, Xf)? — UA, Uy + (O_ Xk)? — bRO_vRy} 
(6.5.10) 


tT We are again cheating slightly here. The rigorous statement is that we must compactify 
the X boson on a so-called Æ> orbifold with this radius in order for the spectrum of X 
to be identical to that of two free Majorana fermions. The equivalence between these 
bosonic and fermionic systems can be demonstrated explicitly at the level of their full 
underlying left/right two-dimensional conformal field theories. By contrast, compactify- 
ing X on a circle of this radius yields the spectrum of a single complex fermion, and the 
full left/right conformal field theory corresponding to a single complex fermion actually 
differs from that corresponding to two real fermions. These distinctions between circles 
and orbifolds, and likewise between a single complex fermion and two real fermions, will 
not be relevant for what follows. 
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where Wr are Majorana-Weyl (real) right-moving worldsheet fermions, 
where Yz are complex Weyl left-moving fermions, and where u = 0,1,...,9 
and i = 1,...,16. 


6.5.3. Quantizing the heterotic string 


The next step, then, is to quantize the worldsheet fields of the heterotic 
string. The quantization of the bosonic fields X“ and worldsheet Majorana 
fermions ~ was discussed in previous lectures, and does not change in 
this new setting. The only new feature, then, are the mode-expansion and 
quantization rules for the complex fermions Y$. 

Once again, there are two possible mode expansions for the left-moving 
complex fermions Y, depending on whether we choose Neveu-Schwarz (anti- 
periodic) or Ramond (periodic) boundary conditions.* In the case of anti- 
periodic boundary conditions, recall that our mode-expansion (6.3.12) for 
left-moving real (Majorana) fermions can be written in the form 


ploto) = X [per eaten) of Bl tinea) (6.5.11) 
r=1/2 


where we recall the hermiticity condition b_, = b}. Thus, for a left-moving 
complex fermion, our analogous mode-expansion takes the form 


W(orto2) = Y [bre irieste2) 4. diea] (6.5.12) 
r=1/2 


which of course implies 


toato) = X [ileti teata dg itatea] . (6.5.13) 
r=1/2 


For r > 0, b, destroys fermionic excitations and b} creates them, while 
d, destroys anti-fermionic excitations and di, creates them. Thus, as ex- 
pected, the only new feature is the presence of twice as many mode degrees 
of freedom, one set associated with fermionic excitations and the other 


tBecause there is no worldsheet supersymmetry that relates these left-moving fermions 
to corresponding left-moving bosons X, more general boundary conditions may actually 
be imposed in this case. However, for heterotic strings in ten dimensions, it turns out that 
we can restrict our attention to periodic or anti-periodic boundary conditions without 
loss of generality. Fermions with generalized worldsheet boundary conditions will be 
discussed further in Lecture #7. 
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with their anti-fermionic counterparts. These modes satisfy the usual anti- 
commutation relations 


{b}, bs} = {d}, ds} = Ors : (6.5.14) 


The corresponding number operator and worldsheet energy contributions 
are then given by 


N= X, (bbb, — aid.) 
r=1/2 

Io = X` r (old, +dld,) . (6.5.15) 
r=1/2 


Note that the anti-particle excitations subtract from the number operator 
yet add to the total energy. Finally, as expected, the vacuum energy con- 
tribution from each complex Neveu-Schwarz fermion is twice that for each 
real Neveu-Schwarz fermion: ay = 2a, = —1/24. 
The Ramond case, of course, is more subtle because of the zero-mode. 
It turns out that the complex-fermion mode-expansion is given by 
oO 
U(o, +02) = X [bne niote) 4 dt etim(ortea)) + bo 


n=1 
oo 


W'(o, +02) = Se beers) + der mer tea) 4. bi , (6.5.16) 


n=1 


with the anti-commutation relations 


In (6.5.16), we have explicitly separated out the zero-mode from the higher- 
frequency modes. The number operator and worldsheet energy conributions 
are given by 


N= X (bib. did.) + bibo 
r=1/2 
Lo= X` r (old, + did,) . (6.5.18) 
r=1/2 
Note that there is no worldsheet energy contribution from the zero-modes. 


Finally, the vacuum energy contribution from each complex Ramond 
fermion is twice that for each real Ramond fermion: ay = 2ay = +1/12. 
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One might wonder, at first, why there is no anti-particle zero-mode 
do. However, such an anti-particle zero-mode dg would be equivalent to 
the particle zero-mode bo. The easiest way to see this is to realize that 
ultimately (6.5.16) represents a Fourier-decomposition of the V(a1+0¢2) into 
different harmonic frequencies (exponentials). By its very nature, the zero- 
mode is the constant term in such a decomposition (since it corresponds to 
zero frequency), and this constant term is nothing but bo. However, there 
can only be one degree of freedom associated with a given constant term. 
Having an additional zero-mode dp would thus represent a redundant (non- 
independent) degree of freedom. Of course, whether we associate bo or do 
with the constant term is purely a matter of convention. 

Given this observation, we are finally in a position to explain our count- 
ing of zero-mode states in Lectures #3 and #4. Since there is only one 
zero-mode degree of freedom for each complex worldsheet fermion, there 
can really be only “half” a zero-mode for each real worldsheet fermion. 
This explains the footnote in Sec. 3.2, and also explains why (in the para- 
graph following (6.4.9)) we counted only one zero-mode excitation per pair 
of Majorana fermions. This also explains why, ultimately, the treatment of 
the Ramond zero-mode for a real worldsheet fermion is rather subtle: es- 
sentially we must take a “square root” of the complex Ramond zero-mode 
bo. There does exist a consistent method for taking this square root, but 
this is beyond the scope of these lectures. For our purposes, it will sim- 
ply be sufficient to recall that there is only one zero-mode state for each 
complex worldsheet fermion, or for each pair of real worldsheet fermions. 


6.6. Lecture #6: Some famous heterotic strings 


Our next step is to construct actual heterotic string models, just as we did 
for the superstring. This will be the subject of the present lecture. 


6.6.1. General overview 


Before plunging into details, it is worthwhile to consider the general fea- 
tures that will govern the construction of our heterotic string models. Re- 
call from the previous lecture that the worldsheet fields of the heterotic 
string in light-cone gauge consist of eight right-moving worldsheet bosons 
X$, eight left-moving worldsheet bosons X}, eight right-moving Majorana 
(real) worldsheet fermions ~, and sixteen left-moving complex worldsheet 
fermions Y$ (i = 1,..., 16). 
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The role of the right-moving fermions Y% is the same as in the su- 
perstring: if they have Neveu-Schwarz modings, the corresponding states 
are spacetime bosons, and if they Ramond modings, the corresponding 
states are spacetime fermions. Indeed, by properly stitching these sectors 
together, it may also be possible to obtain spacetime supersymmetry (as 
in the superstring). Note that unlike the superstring, however, these bo- 
son/fermion identifications hold regardless of the modings of the left-moving 
complex fermions V4. This is because only the right-moving fermions carry 
spacetime Lorentz indices yz, and hence only these fermions determine the 
representations of the spacetime Lorentz algebra. 

The role of the left-moving complex fermions Yi is analogous. Be- 
cause they carry internal indices rather than spacetime Lorentz indices, 
the symmetries they carry are also internal, and as we shall see, they can 
be interpreted as gauge symmetries. Indeed, these WV‘, fields are precisely 
the internal fields we were hoping to obtain in Sec. 5.1. When they have 
Neveu-Schwarz modings, these fermions provide “vectorial” (scalar, vector, 
tensor) representations of the internal gauge symmetry. When they have 
Ramond modings, by contrast, they provide “spinorial” representations of 
the internal gauge symmetry. Thus, we expect a rich gauge representation 
structure in these models as well. 

As with the superstring, different models can be constructed depending 
on how the different modings are joined together to form our set of underly- 
ing sectors, and how the corresponding GSO constraints are implemented. 
We shall construct explicit models below. But it is already apparent that 
the heterotic string contains all the ingredients we require for successful phe- 
nomenology. By choosing certain combinations of right-moving fermionic 
modings with left-moving fermionic modings, we can control which gauge- 
group representations are bosonic and which are fermionic. Moreover, by 
choosing the relative modings amongst the left-moving complex fermions, 
we can even control the gauge group that is ultimately produced. 


6.6.2. Sectors and GSO constraints 


Just as in the superstring, we begin the process of model-building by choos- 
ing an appropriate set of underlying sectors and corresponding GSO con- 
straints. Moreover, just as in the superstring, we know that preservation 
of the right-moving worldsheet supersymmetry (or equivalently spacetime 
Lorentz invariance) requires that we choose our eight right-moving fermions 
Yh to all have the same boundary condition in each sector. This implies 
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that we can, if we wish, combine these right-moving fermions to form four 
complex right-moving fermions which we can denote Y%. (We retain the 
index u to remind ourselves that these fields carry indices with respect to 
the spacetime Lorentz algebra, even though strictly speaking only the real 
fields carry such vectorial indices.) However, unlike the superstring, 
there is no longer any such restriction on the boundary conditions of the 
left-moving fermions Y4. Thus, there remains substantial freedom in choos- 
ing the boundary conditions of these left-moving fermions. Ultimately this 
choice becomes the choice of the gauge group for the particular model in 
question. 

In the next lecture, we shall provide a detailed discussion of the rules 
by which one can choose these boundary conditions and determine their 
associated GSO constraints. Therefore, for the time being, we shall simply 
restrict our attention to the sectors listed in Table 6.2. Note that the 
corresponding vacuum energies are also listed in Table 6.2. In order to 
compute these energies, we can continue to use the middle expression in 
(6.4.1) where we recall that ng and nys count the number of real worldsheet 
fermions. Thus, for complex fermions, these numbers are doubled. 


Table 6.2. Eight possible sectors for ten-dimensional het- 
erotic strings, numbered 1 through 8. Here ‘NS’ and ‘R’ 
respectively indicate Neveu-Schwarz (anti-periodic) and Ra- 
mond (periodic) boundary conditions for worldsheet fermions, 
and arg and ay respectively denote the corresponding right- 


and left-moving vacuum energies. 


Before proceeding further, we can immediately deduce some physical 
properties of the string states that would emerge in each sector. First, we 
see that Sector #1 is the only sector from which tachyons can possibly 
emerge. This is because the level-matching constraints prevent tachyons 
in any other sector (i.e., there is no other sector which for which both az 
and apr are negative). Second, we observe that Sectors #2 and #3 cannot 
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give rise to massless states. This again follows from the level-matching 
constraints, and implies that (for phenomenological purposes) we will not 
need to consider the states arising in these sectors. Finally, we observe that 
Sectors #1,3,5,6 give rise to spacetime bosons, while Sectors #2,4,7,8 give 
rise to spacetime fermions. 

In some sense, Sectors #1—4 are the direct analogues of the four possible 
sectors in Table 6.1 for the superstring. Thus, the heterotic models that 
result from these sectors will be the analogues of the Type 0 and Type II 
superstring models. However, the additional Sectors ##5-8 represent new 
sectors that arise only for heterotic strings. We hasten to add that these 
sectors are not unique, and others could equally well have been chosen. We 
will discuss these possibilities in the next lecture. 

The next issue we face is to determine which combinations of sectors 
form self-consistent sets. It turns out (following the rules to be discussed 
in Lecture #7) that there are three different possibilities: 


e Case A: we consider Sectors #1 and #2 by themselves; 
e Case B: we consider Sectors #1 through #4 by themselves; or 
e Case C: we consider all Sectors #1 through #8. 


For each of these cases, there is then a different set of GSO constraints for 
each sector. As we have seen in our discussion of the superstring, the more 
sectors we have in our model, the more GSO constraints there are in each 
sector. In particular, each time the number of sectors doubles, the number 
of GSO constraints in each sector increases by one. For completeness, 
Table 6.3 lists the GSO constraints that apply in each sector for each of 
these three cases. 

Once again, observe the pattern of the GSO constraints. In Case A, we 
have only Sectors #1 and #2, for which all right-moving and left-moving 
boundary conditions are identical. Thus, the GSO constraints that apply 
in Case A combine Nz and Np together. (Recall that since Nz R € Z, 
we can just as easily write the GSO constraint for Case A as Nz + Nr = 
odd.) When we move from Case A to Case B, we introduce two new sectors 
(Sectors #3 and #4 in Table 6.2) which “twist” the boundary conditions of 
the right-movers relative to those of the left-movers. This has the effect of 
introducing a new GSO constraint in each sector, one which distinguishes 
separately between Nz and Nr. Finally, when we move from Case B to 
Case C, we introduce four new sectors (Sectors #5 through #8) which 
introduce an additional “twist” that distinguishes between the first eight 
left-moving fermions ce g ® and the second eight left-moving fermions 
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which is sensitive only to © Nz = 5$; N®. This suggests (and we shall 
see explicitly in Lecture #7) that the set of sectors is deeply correlated with 
the set of GSO constraints that are applied in each sector: each new “twist” 
introduces both a new set of sectors and a new GSO constraint in each 
sector. The fact that we are considering only Ramond or Neveu-Schwarz 
boundary conditions for our left-moving complex fermions V4 means that 
each successive twist doubles the number of sectors and introduces one 
new GSO constraint in each sector. These are called Zə twists. If we 
were to consider more general “multi-periodic” boundary conditions for 
the left-moving fermions (which is possible because they are not related to 
the left-moving worldsheet bosons by worldsheet supersymmetry), then we 
could introduce so-called “higher-order” twists that would result in more 
complicated GSO constraints. However, it turns out that in ten dimensions, 
we lose no generality by restricting our attention to such Zə twists. 


6.6.3. Four ten-dimensional heterotic string models 


It is apparent from Table 6.3 that Case A and Case B each correspond 
to one heterotic string model, while Case C corresponds to two separate 
heterotic string models. Thus, the GSO constraints in Table 6.3 together 
give rise to four distinct heterotic string models. In the remainder of this 
lecture, we shall work out the physical properties of these four models. 


6.6.3.1. The non-supersymmetric SO(32) string 


Let us begin by considering Case A, which consists of only Sectors #1 and 
#2. Only Sector #1 (the so-called “NS-NS sector”) can contain massless 
states. As indicated in Table 6.1, the vacuum energy in this sector is 
(ar, aL) = (—1/2,-1). Thus, at the bare minimum, the level-matching 
constraint Lo = Lo forces us to excite at least a half-unit of energy on 
the left-moving side. This can be accomplished by exciting any of the 
left-moving half-unit fermionic modes, since in this sector the left-moving 
fermions all have Neveu-Schwarz boundary conditions and thus contain 
half-integer modings. This produces the 32 possible states 


lOr 8 bt 1 j2l0}z and |0)p @ d 1j2l0}z - (6.6.1) 


Note that these states also satisfy the single applicable GSO constraint 
Nz, — Nr = odd, so they remain in the spectrum. From (6.4.6), we see that 
these states are tachyonic with a’ M? = —2. 
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Table 6.3. 
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GSO constraints for each of the eight heterotic string sectors 


in Table 6.2. Here the notation (8) Nz; = soos N®) indicates the total 
left-moving number operator for only the first eight left-moving complex 
fermions. As before, the braces indicate different correlated choices of GSO 
projections, so that we simultaneously choose either the upper choice or 
the lower choice for all sets. 


Sector # 
1 


Case A 
Nz, — Nr = odd 


Nz — Nr = odd 


Case B 
Nz, — Nr = odd 
Ny = even 


Nz — Nr = odd 
Nz, = even 


Nz — Nr = odd 
Nz, = even 


Nz, — Nr = odd 
Ny = even 


Case C 
Nz — Nr = odd 
Ny = even 
(8) Nz = even 
Nz = NR = odd 
Nz, = even 
(8) Nz = even 
Nz arg NR => odd 
Nz, = even 


dd 
NEE { So} 


Nz — Nr = odd 
Nz = even 


‘odd 
Nr = { even \ 
(8) Nr = even 
Nz = NR = odd 


odd 
NDS ee 
(8) N; = even 
Nz — NR = odd 


odd 
Ne { even \ 


dd 
®Nı = { iat 


Further states are realized by exciting higher worldsheet modes. Be- 
cause our worldsheet modes are quantized in minimum half-integer steps, 
we see that the next excited states in this model are massless. These states 
come in two varieties: 


Bs 2l0)r @ a%,|0), 


(6.6.2) 
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and 

b j2 b i2 l0)z 
bij dij, (0) 7, 
dij b i2 l0); 
dija d ij l0); 


b! /10)n ® (6.6.3) 


In (6.6.2), we have excited the lowest mode of the left-moving worldsheet 
boson X/', whereas in (6.6.3) we have excited two of the lowest modes of 
the left-moving fermions pi, Note that it is possible to excite both the 
particle and anti-particle modes from the same fermion Yt, and thus there 
is no restriction that i # j. Also note that all of these states in (6.6.2) and 
(6.6.3) satisfy the GSO constraint Nz — Npr = odd. While Nr = 1 in all 
cases, we have Nz = 0 in (6.6.2) (since the number operators are defined 
not to include the contributions from worldsheet bosons), and Nz = 2 in 
(6.6.3). 

How do we interpret these states? Once again, the states (6.6.2) are eas- 
ily recognized as our gravity multiplet, consisting of the spin-two graviton 
guv, the spin-one anti-symmetric tensor B,,, and the spin-zero dilaton ¢. 
It is interesting to note that this state (6.6.2) is realized as a hybrid of the 
gravity multiplet state in the bosonic string (6.2.19) and in the superstring 
(6.4.7). This reflects the underlying construction of the heterotic string, 
and ensures that the heterotic string, like its predecessors, is also a theory 
of quantized gravity. Once again, the appearance of the gravity multiplet 
is a useful cross-check of the GSO constraints. 

The states in (6.6.3) have a different interpretation, however. Clearly, 
their Lorentz structure indicates that they are massless Lorentz vectors. 
Thus, they are to be interpreted as spacetime gauge bosons. Thus, we 
see that the heterotic string has succeeded in providing us with spacetime 
gauge symmetry, just as we had originally hoped. 

But what is the gauge group? Of course, the gauge group is ultimately 
determined from the i,j indices, and since (in Cases A and B) we have 
not destroyed the rotational symmetry in the space of the 16 complex left- 
moving fermions W*, (or the 32 real left-moving fermions into which they 
can be decomposed), we immediately suspect that the gauge symmetry 
should be SO(32). There are number of ways to deduce that this is correct. 
Perhaps the easiest way is simply to count the gauge boson states in (6.6.3). 
If we restrict our attention to the cases i Æ j, then there are (2-16)(2-15)/2 
states. The first factor (2-16) reflects the fact that for each of the 16 


String Theory, String Model-Building, and String Phenomenology 335 


possible choices of Y$, we can excite either the fermion or anti-fermion 
mode. The second factor (2-15) reflects the same set of options for the 
second fermion vi, and we divide by two as the interchange symmetry 
factor. There are also the cases with i = j: from such cases we obtain 16 
possible states, reflecting the 16 different fermions Y4, whose fermion and 
anti-fermion modes are jointly excited. The total number of states is then 


(2- 16)(2- 15) 
2 


Of course, the above counting method for determining the gauge group 
is hardly precise, for there are a number of gauge groups with the same 


+ 16 = 496 = dim S0(32) . (6.6.4) 


overall dimension (and we shall come across another such gauge group very 
soon). We therefore require a more sophisticated method which also gener- 
alizes to more complicated cases. By definition, of course, the gauge group 
can be determined by explicitly examining the charges of the gauge boson 
states and determining which Lie algebra (i.e., which root system) they 
fill out. We therefore need a way of determining the charges of the gauge 
boson states. Since our gauge symmetry is ultimately associated with the 
left-moving worldsheet fermions W‘,, the relevant current in this case is 
simply the worldsheet current Jt = Y$ Y}. From this, we can deduce the 
associated charge Q;. It turns out that 


Great Leap #7: The charge associated with each world- 
sheet fermion VW‘, for a given string state with fermionic ex- 
citation number N™ is given by Qi = N® +q;. Here qi; is 
a “background” charge which is 0 if U4, is a Neveu-Schwarz 
fermion and —1/2 if Y$ is a Ramond fermion. 


Given this result, we can easily deduce the gauge group for the case 
in question. For simplicity, let us first imagine that there are only two 
left-moving fermions gi? In this case, (6.6.3) reduces to six states: 


be 1/267 1/210), » bey y2d2 4/210), dŁ1j2b2172l0)z ; 
5794? 4/210), » br ayod4 4/210), B49? 42/0), - (6.6.5) 


For each of these states, there are two charges, Qı and Q2, associated with 
each of the two complex fermions. If we denote these states as A through F 
respectively, we can plot the charges of these six states as in Fig. 6.8. The 
resulting diagram is easily recognized as the root system (or equivalently 
the weight system of the adjoint representation) of the Lie group SO(4). 
Generalizing from two complex fermions to n complex fermions analogously 
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Fig. 6.8. The two-dimensional “charge lattice” associated with the six string states A 
through F in (6.6.5). Note that the two states E and F fill out the Cartan subalgebra 
of the root system. For a ten-dimensional heterotic string, the charge lattice is always 
sixteen-dimensional (generally implying a gauge group of rank 16), with a Cartan sub- 
algebra consisting of sixteen gauge boson states. 


yields the gauge group SO(2n), provided that all n complex fermions have 
the same modings. Thus, in the case of 16 complex fermions, we find the 
gauge group SO(32). 

Note that this argument suffices to show that the gauge bosons fill out 
the adjoint representation of SO(32). However, it does not demonstrate 
that all other string states in the model fall into representations of this gauge 
group. Of course, this is required for the consistency of the string. However, 
such a result can indeed be proven mathematically by constructing the 
current operators associated with the gauge group in question (as discussed 
above), and demonstrating that all states surviving the appropriate GSO 
constraints transform appropriately under these currents. For example, 
the 32 tachyonic states in (6.6.1) transform in the vector representation of 
SO(32), and the gravity multiplet (6.6.2) transforms as a singlet of SO(32) 
(as it must). However, a proof that this holds for all states in both the 
massless and massive string spectrum is beyond the scope of these lectures. 

We should also point out that what emerges in such closed string theo- 
ries is not simply the algebra associated the gauge symmetry in question, 
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but rather an infinite-dimensional extension (or “affinization”) of it. Such 
affine Lie algebras are discussed in Ginsparg (reference given at the end 
of Lecture #1), and play an important role in the consistency and phe- 
nomenology of such heterotic string theories. 

To summarize, then, we see that Case A results in a tachyonic string 
model with quantum gravity and SO(32) gauge symmetry. In addition 
to 32 scalar tachyons transforming in the vector representation of SO(32), 
this model contains massless gauge bosons transforming in the adjoint rep- 
resentation of SO(32) as well as the usual gravity multiplet. This non- 
supersymmetric SO(32) heterotic string model is the heterotic analogue of 
the Type 0 string models in Lecture #4. 


6.6.3.2. The supersymmetric SO(32) string 


Let us now proceed to Case B. In this case there are four sectors (#1 
through #4 in Table 6.2), and we must impose the GSO constraints listed 
in the second column of Table 6.3. 

Let us begin by considering the states from Sector #1. These are the 
same as those considered in Case A, except that we must now impose the ad- 
ditional GSO constraint Nz = even. This projects out the tachyonic states 
(6.6.1), but preserves the gravity multiplet as well as the gauge bosons. 

As we discussed previously, Sectors #2 and #3 contain no massless 
states. Therefore, all that remains is to consider the states from Sector #4. 
Here the vacuum energy is (ar,a) = (0,—1). The right-moving ground 
state in this sector is the Ramond zero-mode ground state, which we have 
previously denoted {b4}|0) r» and thus massless states are realized only 
through non-zero excitations of the left-movers. The possible states are 


a210); 

b j2 b 4/210), 
{0540} z 8 b ija @ ah |0) 7. (6.6.6) 

di ija b ij l0)z 

dijz dija |0);, - 


In each case, the GSO constraints imply that we can excite only an odd 
number of right-moving zero-modes. According to our previous conven- 
tions, this indicates that the right-moving ground state corresponds to the 
spacetime Lorentz spinor Sg (rather than the conjugate spinor Cs). 
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It is, by now, easy to interpret the states in (6.6.6). The first state 
provides the superpartner states to the gravity multiplet, and contains a 
gravitino. This implies that the model has spacetime supersymmetry. Like- 
wise, the remaining states correspond to the superpartners of the SO(32) 
gauge bosons, and contain the SO(32) gauginos. The chirality of these 
spinor states is fixed by the GSO constraint and the right-moving ground 
state Ss. 

Summarizing, we see that this model therefore consists of the following 
states. We shall describe these states using the notation R1@(R2; R3) where 
R,, Ro are representations of the spacetime Lorentz group, and where R3 
is a representation of the SO(32) gauge group. These states consist of 


Vs 8 (Vs;1), Vs @ (ljadj), Ss @(Vs;1), Sg @ (1;adj) , (6.6.7) 


where the first and third states form the N = 1 supergravity multiplet and 
the second and fourth states form the SO(32) gauge boson supermultiplet. 
Together these states can be written in the factorized form 


(Vs @ Ss) & {(Vs;1) @ (1; adj)} , (6.6.8) 


thereby explicitly exhibiting the supersymmetry Vg > Sg. 

This string is the famous supersymmetric SO(32) heterotic string. Al- 
though not directly relevant for string phenomenology, this string plays a 
vital role in recent developments in string duality (to be discussed briefly 
in Lecture #8). 


6.6.3.3. The SO(16) x SO(16) and Eg x Es strings 


Let us now proceed to Case C. As discussed in Sec. 6.2, this case differs 
from Case B because we have now “twisted” the second group of eight left- 
moving complex worldsheet fermions relative to the first set. A priori, it is 
easy to imagine that this twist will break the gauge symmetry SO(32) > 
SO(16) x SO(16). However, there a few surprises still in store for us. 

We begin in Sector #1, which previously gave rise to the states given 
in (6.6.2) and (6.6.3). Introducing the third GSO constraint © N; = 
ae N® = even does not affect the gravity multiplet (6.6.2), but has 
a drastic effect on the remaining gauge boson states. We now see that we 
cannot excite arbitrary combinations of (i,7) fermions; instead we must 
choose either (i, j) = 1,...,8 or (i,j) =9,...,16. In string-theory parlance, 
all of the other states have been “projected out of the spectrum”. It is in 
this manner that we remove gauge boson states and break gauge symme- 
tries in string theory. (There are other methods for doing this in string 
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theory, but this is the only method at tree-level.) It is easy to see (follow- 
ing the arguments given above) that the remaining gauge boson states fill 
out the adjoint representation of two copies of SO(16), and thus the gauge 
group is a priori SO(16) x SO(16). Therefore, we shall henceforth denote 
our string states in the notation Ri ® (Ro; R3, R4) where Ri, Rə are the 
representations of the Lorentz group from the right- and left-movers, and 
where R3, R4 are the representations with respect to the two gauge group 
factors of SO(16) respectively. Thus, we see that Sector #1 gives rise to 
the states 


Vs 9 (Vs; 1,1) ’ Vs ® (1; adj, 1) ’ Vs ® (1; 1, ad)j) ’ (6.6.9) 


where the first states form the gravity multiplet and the second and third 
states are the SO(16) x SO(16) gauge bosons. 

As before, Sectors #2 and #3 do not give rise to massless states. Let 
us now consider what happens in Sector #4. The states that previously 
emerged in Sector #4 are given in (6.6.6). We now must impose the remain- 


dd 
ing GSO constraint 8) N; = { ae i Let us consider each case separately. 


If we impose the odd choice, then the gravitino state in (6.6.6) is projected 
out of the spectrum, indicating that supersymmetry is broken. Likewise, we 
find that the gaugino states are also affected: we can now excite only those 
states for which i = 1,...,8 and j = 9,...,16. This spinor state transforms in 
the (16, 16) representation of SO(16) x SO(16) (i.e., as the vector-vector bi- 
fundamental). By contrast, if we impose the even choice, then the gravitino 
state in (6.6.6) remains in the spectrum, indicating that supersymmetry is 
preserved. Likewise, the gaugino states are affected only by the new re- 
quirement that either i, j = 1,...,8 or i, j = 9,...,16. Thus, the new GSO 
projection projects our SO(32) gauginos down to SO(16) x SO(16) gaugi- 
nos, as expected. Summarizing, we find that in the “even” case, the states 
from Sector #4 are 


Let us now consider Sector #5. As indicated in Table 6.2, in this sector 
the vacuum energy is (ar,a) = (—1/2,0) and the first eight left-moving 
complex fermions are Neveu-Schwarz while the second eight are Ramond. 
Choosing the “odd” GSO constraints projects all possible massless states 
out of the spectrum (because there is no simultaneous solution to all three 
GSO constraints in the “odd” case). By contrast, choosing the “even” GSO 
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constraints yields the states 
b /10)n ® {bo}10), (i =9,...,16) (6.6.11) 


where we must choose an even number of zero-mode excitations on the left- 
moving side. This produces a massless vector state which transforms in 
a (128-dimensional) spinorial representation of the second SO(16) gauge 
group factor. Following our previous conventions, we shall refer to this 
spinor as Cj2g rather than its conjugate S128. This state can therefore be 
denoted as 


Vs ® (1; 1, Cig) . (6.6.12) 


We shall discuss the physical interpretation of this state shortly. 

Sector #6 is similar to Sector #5, except that now the first eight left- 
moving complex fermions are Ramond and the second eight are Neveu- 
Schwarz. In a similar way we then find that there are no states in the 
“odd” case, while in the “even” case we find the states 


Vs © (1; Ci28, 1). (6.6.13) 


We now turn to Sector #7. Here the vacuum energy is (ar,a) = 
(0,0), which implies that if we restrict our attention to massless states, we 
can tolerate only zero-mode excitations amongst both the left- and right- 
movers. In the “odd” case, we find the states 


{bE 30) © {bh}|0), (i=9,..., 16) (6.6.14) 


where the GSO projections restrict us to an even number of zero-mode 
excitations on the right-moving side and an odd number on the left-moving 
side. According to our conventions, this produces the state Cg®(1; 1, S128). 
In the “even” case, by contrast, we are restricted to (6.6.14) where now we 
must have an even number of zero-mode excitations on the right-moving 
side and an odd number of the left-moving side. This produces the state 


Sg ® (1; 1, C128) . (6.6.15) 


Finally, in Sector #8, we similiarly find the states Cs ® (1; S128, 1) in 
the “odd” case and 


Ss ® (1; Cis, 1) (6.6.16) 


in the “even” case. 
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What are we to make of these results? Collecting our states for the 
“odd” case, we find a string model with the following massless spectrum: 


Vs @(Vs31,1), Vs @(lj;adj,1), Vs ® (1;1,adj) 
Ss ®(1;Vie, Vis), Cg ®(1;Si28,1), Cg @(1;1,Si2g). (6.6.17) 


This is clearly a non-supersymmetric spectrum consisting of a gravity mul- 
tiplet, vector bosons transforming of the adjoint of SO(16) x SO(16), one 
spinor transforming as a vector-vector bifundamental with respect to the 
gauge group, and two additional spinors of opposite chirality transform- 
ing in the spinor representations of the gauge group. This is the non- 
supersymmetric SO(16) x SO(16) heterotic string model, first constructed 
in 1986. Note that this spectrum configuration is anomaly-free, as required 
for a self-consistent string theory. Also note that this string is tachyon- 
free even though it is non-supersymmetric. This example thus proves that 
not all non-supersymmetric strings have tachyons (although it is certainly 
true that all supersymmetric strings lack tachyons). While this is the only 
non-supersymmetric tachyon-free heterotic string in ten dimensions, there 
exist a plethora of such strings in lower dimensions. We shall discuss some 
of the properties of such strings in Lecture #8, but this raises an interest- 
ing issue: Does string theory predict spacetime supersymmetry? As this 
example makes clear, string theory certainly does not predict spacetime 
supersymmetry on the basis of tachyon-avoidance. However, the general 
answer to this question is unknown. 

Even more interesting is the model that results in the “even” case. 
Collecting our states from (6.6.9), (6.6.10), (6.6.12), (6.6.13), (6.6.15), and 
(6.6.16), we find that the total massless spectrum of this string can be 
written in the factorized form 


(Vs 5s) $ {(Vs; 1, 1) @ (1; {adj © C128}, 1) (1; 1, {adj © C128})} . 

(6.6.18) 
The appearance of the right-moving factor V8 ®Ss indicates that this model 
has N = 1 supersymmetry, as expected from the appearance of a single 
gravitino in the massless spectrum. The left-moving factor, by contrast, 
contains three terms. The first term combines with the right-moving factor 
to produce the supergravity multiplet. The second two terms formerly 
gave rise to the SO(16) x SO(16) gauge supermultiplet. However, we now 
see that for each SO(16) gauge group factor, the massless vector states 
transform in the adj 6 C128 representation rather than simply in the adj 
representation. While the adj contribution is easy to interpret (giving rise 
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to the usual gauge bosons of SO(16)), the extra massless vector states 
transforming in the C128 representation of each gauge group factor appear 
to cause an inconsistency, for we know that all massless vector states must 
be interpreted as gauge bosons, and hence such states can only transform 
in the adjoint representation. Thus, the only possible way that this string 
can be consistent is if the massless vector states in this model somehow 
combine to fill out the adjoint representation of some other group G: 


adjsocie) ® Cizs = adjg . (6.6.19) 


Remarkably, this is precisely what occurs: the group G is nothing but the 
exceptional Lie group Es! Indeed, the 120 states of the adjoint represen- 
tation of SO(16) together with the 128 states of the spinor representation 
of SO(16) combine to produce the 248 states of the adjoint representation 
of Es! In string parlance, we thus say that the presence of the “twisted” 
states (6.6.12), (6.6.13), (6.6.15), and (6.6.16) has enhanced the total gauge 
group from SO(16) x SO(16) to Eg x Ex. This, then, is the famous super- 
symmetric Eg x Eg heterotic string. 

Unlike the supersymmetric SO(32) string, this string is generally consid- 
ered to have excellent phenomenological prospects. It has N = 1 spacetime 
supersymmetry, quantum gravity, and an Eg x Eg gauge symmetry. Eg 
is a compelling gauge group for phenomenology because it contains Ee as 
a subgroup, and Ee is a group that contains chiral representations which 
can be associated with grand unification and which thereby contain all of 
the particle content of the Standard Model. (Of course, it is still neces- 
sary to obtain actual matter representations from this string, but these can 
arise upon compactification.) Moreover, while we can imagine the Stan- 
dard Model to reside entirely within one of the Eg gauge group factors, 
the other factor may be interpreted as a “hidden” sector which can also 
have important phenomenological uses (such as triggering supersymmetry 
breaking, providing dark-matter candidates, and enforcing string selection 
rules). Thus, historically, much of the original work in string phenomenol- 
ogy began with a study of the compactification of this model down to four 
dimensions. However, it is possible to construct heterotic string models 
directly in four dimensions, and to obtain models which do not necessarily 
have an interpretation as arising via the compactification of any particular 
string model in ten dimensions. Thus, as we shall see, the prospects for 
phenomenological heterotic string model-building are broader than merely 
studying the compactifications of the Eg x Eg heterotic string. 
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6.6.4. More ten-dimensional heterotic strings 


So far, we have constructed four heterotic string models in ten dimensions. 
Of these, two have spacetime supersymmetry, and two do not. However, it 
is readily apparent that further models can be constructed by introducing 
further “twists” which further enlarge the set of sectors in Table 6.2 and 
which further break the gauge group into smaller factors (or which break 
the original SO(32) gauge group in entirely different ways). The question 
that arises, then, is whether there exist other ten-dimensional heterotic 
strings with spacetime supersymmetry, or whether there exist other non- 
supersymmetric strings in ten dimensions that are tachyon-free. The answer 
to both questions turns out to be “no”. A complete list of ten-dimensional 
heterotic strings is given in Table 6.4. 


Table 6.4. The complete set of ten-dimensional heterotic 
string models. Two have spacetime supersymmetry, one is 
non-supersymmetric but tachyon-free, and the remaining six 
are non-supersymmetric and tachyonic. 


gauge group spacetime SUSY? | tachyon-free? 
SO(32) yes yes 
Eg x Eg yes yes 
SO(16) x SO(16) no yes 
SO(32) no no 
SO(16) x Eg no no 
SO(8) x SO(24) no no 
(E7)? x [SU(2)]? no no 
U(16) no no 
Eg no no 


The presence of the last string in Table 6.4 might seem surprising. After 
all, the rank of the gauge group for this string is only eight rather than 
sixteen, which implies that its construction must differ substantially from 
that of the previous strings. It turns out that this is indeed the case.* We 
briefly indicate in Lecture #7 how such strings may be constructed. 


*Unlike the other ten-dimensional heterotic strings, this string involves splitting each 
complex worldsheet fermion into a pair of two real worldsheet fermions and then intro- 
ducing relative “twists” within each pair. In technical language, this results in a gauge 
group whose rank is reduced but whose so-called affine level is increased relative to 
those of the other strings. This increase in the affine level is important for string GUT 
model-building, and will be discussed in subsequent lectures. 
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6.7. Lecture #7: Rules for string model-building 


In the last several lectures, we constructed many different string models. 
Amongst the superstring models, we constructed the Type 0A, Type OB, 
Type IIA, and Type IIB models, while amongst the heterotic string 
models, we constructed the non-supersymmetric SO(32) model, the non- 
supersymmetric SO(16) x SO(16) model, and the supersymmetric SO(32) 
and Eg x Eg models. In each case, we simply asserted a set of sectors 
(combinations of Neveu-Schwarz and Ramond modings) and a set of GSO 
constraints in each sector. Of course, each of these sets of sectors and GSO 
constraints conspires to yield a self-consistent string model, and occasion- 
ally it is even possible to see intuitively which choices can lead to self- 
consistent string models. However, we ultimately wish to construct semi- 
realistic string models where the groups are broken down to much smaller 
pieces than we have been dealing with thus far (e.g., SU(3) x SU(2) x U(1), 
or even SU(5) or SO(10)), and this is going to require more complicated 
twists than we have thus far been using. Furthermore, all of our string 
models thus far have been in ten dimensions, yet we are ultimately going 
to wish to compactify our string models to four dimensions. It turns out 
that this will introduce even further choices for modings, twists, and their 
associated GSO projections. (In geometric language, these further choices 
amount the choice of compactification manifold.) 

The question that arises, then, is to determine the minimal set of param- 
eters that govern these choices. What we require is a way to systematize the 
whole process of string model-construction, so that we will know precisely 
which choices govern the construction of a string model and guarantee its 
internal self-consistency. In other words, we require rules for string model- 
building. This is the subject of the present lecture. 

Once we learn the rules for the construction of ten-dimensional string 
models, it will be relatively straightforward to generalize these rules for the 
construction of models in four dimensions. We will then have the tools 
whereby we may finally construct semi-realistic four-dimensional string 
models. 


6.7.1. Generating the sector combinations: The 20-dimensional 
lattice 


The first issue we face is that of choosing the appropriate sector combi- 
nations. For example, let us recall the possible heterotic string sectors in 
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Table 6.2. As we discussed in Sec. 6.2, this set of sectors permits only three 
distinct sector combinations: either we choose Sectors #1 and #2 only, or 
we choose Sectors #1 through #4 only, or we choose Sectors #1 through 
#8. How can we know which combinations are allowed, and which sectors 
are required in each grouping? In Sec. 6.2, we discussed how modular in- 
variance ultimately governs these choices. Here, however, we shall develop 
a rule which we can use in order to deduce these sector combinations rather 
quickly and which can easily be generalized to more complicated situtions. 

First, let us introduce some notation. Since it is rather awkward to con- 
sider left-moving complex fermions Y% at the same time as right-moving 
real (Majorana) fermions wp, let us “complexify” our right-moving Majo- 
rana fermions so that all of our worldsheet fermions are complex. This 
means that instead of having eight left-moving real fermions ~ in light- 
cone gauge, we have instead four complex ones 4 formed by pairing the 
left-moving real fermions in groups of two. (We retain the index p to re- 
mind ourselves that these fields carry indices with respect to the spacetime 
Lorentz algebra, even though strictly speaking it is only their real compo- 
nent fields ~ that carry such vectorial indices.) 

We also need a more general notation for discussing the possible bound- 
ary conditions and modings that any such complex worldsheet fermion can 
take. In general, we can parametrize any possible worldsheet boundary 
condition in the form 


W(o,+7,02) = — e? (04, 02) (6.7.1) 


where -4 <v< Z. Thus the quantity v parametrizes the boundary 


condition of the individual fermion, with 
v=0: anti-periodic (Neveu-Schwarz) 
v= —1/2: periodic (Ramond) . (6.7.2) 


General values of v correspond to so-called “multi-periodic fermions”. 
For example, the general moding of a multi-periodic left-moving complex 
fermion is given by 


Ur (o1 + 02) = b aosi ae AEk) 
n=1 
+4 Gh gape Le MIe2 (6.7.3) 


and the corresponding number operator and worldsheet energy are defined 
accordingly. Note that these modings generalize those given in Sec. 5.3. 
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Likewise, the vacuum energy contribution from such a fermion is given by 


ie | 


This too generalizes our previous results. 

In ten dimensions, it turns out that we lose no generality by consid- 
ering only the specific cases v = 0, —4 for all worldsheet fermions. What 
this means is that all self-consistent ten-dimensional string models can ul- 
timately be realized using worldsheet fermions with only Neveu-Schwarz 
or Ramond boundary conditions. In lower dimensions, by contrast, other 
choices are possible. Therefore, even though we shall primarily focus our 
attention on the cases v € {0,—4}, we shall develop our formalism in such 
a way that it holds for arbitrary values of v. 

Given this parametrization, we can describe the boundary conditions 
within any sector rather succinctly by specifying twenty v-values, four for 
the complex right-movers Y% and sixteen for the complex left-movers vi. 
We can group these twenty v-values to form a “boundary-condition” vector 


vV = (01, Be, U3, V4 | v1, ..., viel 5 (6.7.5) 


and thus we may associate a vector with each underyling string sector. For 
example, the sectors in Table 6.2 now correspond to the vectors shown in 
Table 6.5. Note that in Table 6.5, we have used a shorthand notation in 
which superscripts indicate repeated components. We have also dropped 
the minus signs from the Ramond entries v = — 4. We stress, however, that 
even though we shall no longer explicitly indicate the Ramond minus sign, 
it should continue to be implicitly understood for all Ramond boundary 
conditions. (This minus sign can play an important role for string models 
in lower dimensions. ) 

What, then, are the self-consistent combinations of sectors? Recall from 
the previous lecture that the first self-consistent combination of sectors 
comprises Sectors #1 and #2 only. Let us therefore study this simplest 
combination. Sector #1 (the so-called NS-NS sector) corresponds to the 
zero-vector 0, the vector whose entries all vanish. Thus, in this sense, we 
might associate the NS-NS sector with the origin in a twenty-dimensional 
vector space. Sector #2 (the so-called Ramond-Ramond sector) then cor- 
responds to some other point in the vector space which is some distance 
away from the origin. Let us call this other location Vo = [(3)*|($)"9]. 

If we were to consider Vo to be a lattice basis vector, a natural question 
would be to determine the lattice that is generated by this basis vector. 


String Theory, String Model-Building, and String Phenomenology 347 


Table 6.5. The eight possible sec- 
tors for ten-dimensional heterotic 
strings from Table 6.2, written in 
the boundary-condition vector nota- 
tion of (6.7.5). Here the superscripts 
indicate repeated components, and 
we have dropped the minus sign for 
Ramond boundary conditions. 


Sector # vV 


Sj= 
I= 
peen 
n e 
ol— 
S|] ~ 


i 
0 
~n 


[aan 
Ie ONIH 


Because there is only one such non-zero vector, this would clearly be a 
one-dimensional “lattice”. Since Vo = [(4)*|(4)*6], the next point in the 
lattice would be 2Vo = [(1)*|(1)'®]. How can we interpret this point? 
Recall from (6.7.1) that the components of such vectors (i.e., the values of 
v) are defined only modulo 1 (i.e., they are restricted to the unit interval 
—4 <u< z). Thus, we see that v = 1 is physically the same as v = 0, once 
again implying a Neveu-Schwarz boundary condition. In other words, we 
should only add our vectors modulo 1. Given this, we find that 2Vo + 0, 
where we have introduced the notation = to indicate equality modulo 
1. Likewise, 3Vo = Vo, and so forth. Thus, we see that Vo generates a 
“lattice” consisting of only two physically distinct “points”: 


{0, Vo} . (6.7.6) 


However, these are precisely the two “points” that comprised our first self- 
consistent set of sectors (Case A in Lecture #6), and which led to our first 
string model! 

It turns out that this is a general property: All self-consistent choices 
of string sectors are those that correspond to the “points” in a twenty- 
dimensional lattice generated by a set of basis vectors. To illustrate this 
principle, let us consider the next case (Case B in Lecture #6). In this 
case, we included only Sectors #1 through #4. This indicates that we need 
a larger lattice, which in turn implies the existence of not just the single 
lattice-generating basis vector Vo, but also an additional basis vector V1. 
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One choice is: 

Vo = [(5)*1(3)""I 

Vi = [(0)*| Gs]. (6.7.7) 
Using these choices, we can see that indeed all four of these sectors can 
be generated as the different “points” in the resulting lattice: Sector #1 
corresponds to the origin 0, Sector #2 corresponds to Vo itself, Sector #3 
corresponds to V; itself, and Sector #4 corresponds to the remaining lat- 
tice point Vo + Vi. Note that no other points exist in this lattice, since 
2Vo = 2Vi +0.T hus, we see that the introduction of the additional basis 
vector V, is physically equivalent to the “twist” that shifts the boundary 
conditions of the left-moving fermions relative to those of the right-moving 
fermions in Sectors #3 and #4. 

Finally, let us consider the full set (Case C) consisting of Sectors #1 

through #8. It is easy to see that this set is generated by the three basis 
vectors: 


Vo = [(5)71 (9) 
Vi = [(0)*1()""] 
V2 = [(0)*](3)°(0)*] - (6.7.8) 


Once again, the introduction of the new basis vector V2 implements the 
“twist” that separates the boundary conditions of the first set of eight left- 
moving fermions from those of the second set. 

This procedure can be continued. Each additional basis vector intro- 
duces a new twist, increases the size of the resulting lattice, and leads 
to the introduction of new physical string sectors (so-called “twisted sec- 
tors”). For example, one further basis vector that might be introduced is 
V3 = [(0)4|($)*(0)4($)4(0)4].. This vector would have the effect of intro- 
ducing a further twist amongst the left-moving fermions within each group 
of eight. 

Clearly, given a set of N basis vectors V; (i = 0,..., N—1), the procedure 
for generating the full set of resulting string sectors is to consider all possible 
lattice vectors es a;V; where a; € {0,1}. Note that this restriction on 
the values of a; assumes that we are considering only Neveu-Schwarz or 
Ramond boundary conditions for the worldsheet fermions; generalizations 
to multi-periodic fermions will be discussed shortly. We shall henceforth 
denote a given string sector as aV = )7,a;V;. For example, the NS- 
NS sector (i.e., Sector #1) always corresponds to a = (0,0,...) and the 
Ramond-Ramond sector (i.e., Sector #2) corresponds to a = (1,0,...). 
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At this stage, we now know how to generate the full set of underlying 
string sectors once we are given a “primordial” set of basis vectors Vj. 
The next issue that arises is to determine the rules that govern the allowed 
choices of these basis vectors. Of course, we have already derived one such 
rule: each basis vector V; must take the form 


Vi = [(v)* | v1, «5 v16] (6.7.9) 


where the right-moving fermions all have same moding v € {0, —3}. In- 
deed, as we saw in Lectures #5 and #6, this requirement is necessary for 
the preservation of the right-moving worldsheet supersymmetry (so that the 
right-moving worldsheet supercurrent has a unique moding in each sector). 
This is also necessary for the preservation of spacetime Lorentz invariance, 
since the right-moving worldsheet fermions carry Lorentz spacetime indices. 

As we might expect, there are still several additional conditions that our 
basis vectors V; must satisfy. But before we can discuss these conditions, 
we must turn to the generation of the GSO constraints in each sector. 


6.7.2. Generating the GSO constraints 


We have already seen in previous lectures that the appearance of new string 
sectors is correlated with the appearance of new GSO constraints in each 
sector. We are now in a position to formulate this correlation more pre- 
cisely: in each string sector, there is one GSO projection for each basis 
vector. Our task, then, is to find a simple way to generate the exact forms 
of these GSO projections. 

Let us return to Case A, and consider the model consisting of only 
Sectors #1 and #2. As we have seen above, this model is generated by 
the single basis vector Vo = [(4)*|(4)1°], resulting in the two sectors 0 
(Sector #1) and Vo (Sector #2). In each of these sectors, recall from 
Table 6.3 that we then had the single GSO constraint Nz — Nr = odd, or 
equivalently 


16 4 
ST NO — YONO = odd. (6.7.10) 
i=l 


j=l 


(Here we have used the j-index to span our four complex right-moving 
fermions, while the i-index spans our sixteen complex left-moving fermions.) 
It is this GSO constraint that we now wish to write in a more transparent 
manner. 
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Given our success in using the lattice idea and modular arithmetic in 
order to generate the complete set of string sectors, let us attempt to write 
(6.7.10) in a form that makes use of both ideas. Let us first concentrate on 
the modular arithmetic idea. Since all of our basis vectors are defined only 
modulo one, let us cast (6.7.10) into the form of a modulo-one relation. 
Since (6.7.10) is already a modulo-two relation, this can be achieved by 
dividing by two: 

16 


Ly > NO — 157 N@ =1 (6.7.11) 


i=1 j=l 


Nie 


where we have used the notation = to indicate equality modulo 1. 
Let us now try to incorporate the lattice idea. To do this, let us make 
a vector out of our twenty number operators: 


N = (WY, NO, NO WO) NO, NO] | (6.7.12) 


Clearly, each different possible string state in a given sector corresponds 
to a different N-vector, and the physical (surviving) string states are those 
satisfying (6.7.11). Let us now attempt to write (6.7.11) in a vector no- 
tation. Neglecting the minus sign in (6.7.11) for the moment, we see that 
(6.7.11) involves a sum of vector components, which reminds us of a vector 
dot product. Thus, if we define the “signature” of our twenty-dimensional 
lattice to be [(—)*| (+)!6], we can write (6.7.11) in the form of a vector dot 
product: 


GHN = 3 (6.7.13) 


where we have introduced a vector each of whose components is equal to 
Z. However, this vector is nothing but Vo, the basis vector that generates 
the lattice for this model! Thus, we see that if our model is generated by 
the basis vector Vo, then in each of the resulting sectors {0, Vo} the GSO 
projections take the form 


1 


Vo- N = (6.7.14) 


1 

2 
This produces the non-supersymmetric SO(32) string model from Lec- 
ture #6! 

Let us now consider Case B, consisting of Sectors #1 through #4. As 
we saw in Lecture #6, this produces the supersymmetric SO(32) heterotic 
string model, and is generated by the set of two basis vectors given in 
(6.7.7). In each of the four resulting sectors {0, Vo, V1, Vo + Vi}, the two 
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GSO projections were Nz — Nr = odd and Nz = even. (Recall Table 6.3.) 
These now take the form 


VoN=4, Wi-N= 4. (6.7.15) 


Similarly, Case C is generated by the three basis vectors in (6.7.8), and the 
three GSO constraints in each sector take the general form 


Nr- Nrs., Nrs., ONp=... (6.7.16) 


Here ©) Nz = $; N®, and we shall momentarily defer a discussion of 
the values of the right sides of these constraint equations. We then find 
that these three GSO constraints take the general forms 

VoN ża, Vi-N=..., Vo-N =... (6.7.17) 
Depending on the right sides of these equations, this generates either the 
supersymmetric Es x Eg string or the non-supersymmetric SO(16) x SO(16) 
string. 

The final question, then, is to determine what appears on the right 
sides of these GSO constraint equations. In general, this will be some value 
x which satisfies —4 <a< L, This z-value is called a GSO projection 
phase, and is generally different for each sector. Thus, we know that x 
must itself depend on a, where (as discussed in Sec. 7.1) a parametrizes 
the particular sector in question. We also know from our prior experience 
(in particular, from Table 6.3) that x must also contain some additional free 
parameters because we occasionally still had the freedom to make choices 
such as { evenodd } when constructing our GSO constraints. 

It turns out the final result is the following. Within any given string 
sector aV = a a; V;, the states that survive are those whose number 
operator vectors N satisfy the equations 


N-1 
ViN = So kyo; + si — Vi (aV), OSi< N-1. (6.7.18) 
j=0 


This is therefore the full set of GSO constraint equations for the sector 
aV. In (6.7.18), the notation is as follows. There are N different equations 
here, depending on the value of i. In the last term, the dot product V; - 
(aV) is the dot product between V; and the sector aV for which the GSO 
constraint is being applied. In the second-to-last term, s; is defined as 
the first component (i.e., the first of the right-moving components) of the 
vector V;: 


s = Ve: (6.7.19) 


352 Keith R. Dienes 


Thus s; parametrizes the spacetime statistics of the sector V;, with s; = 0 
indicating spacetime bosons and s; = —} indicating spacetime fermions. 
Likewise, the sum > a;s; (mod 1) indicates the statistics of the sector 
aV. In the remaining term, kj; denotes a certain N x N matrix of num- 
bers (so-called GSO projection phases) satisfying —4 < kij < 4. These 
are therefore the remaining degrees of freedom that enter into our GSO 
constraints. In the case of Zə twists (for which all fermionic boundary 
conditions have either Neveu-Schwarz or Ramond boundary conditions), 
one has kj; € {0, —3} only. The case of multi-periodic fermions will be 
discussed shortly. 

Thus, if we are given a set of parameters {'V;, kij}, we can now generate 
the resulting string model and the entire corresponding spectrum! These 
parameters are ultimately the parameters that physically describe a given 
string model. 


6.7.3. Self-consistency constraints 


We finally turn to the remaining question: what determines how the pa- 
rameters {V;, kij} are to be chosen? What are the rules that guarantee a 
self-consistent choice? 

Clearly, as we have discussed earlier, modular invariance is one of 
many symmetries that govern these choices. Other requirements for self- 
consistency include proper spacetime spin-statistics relations (so that all 
Ramond states are indeed anti-commuting spacetime fermions, and all 
Neveu-Schwarz states are commuting spacetime bosons) and physically 
sensible GSO projections (so that unitarity is not violated, among other 
things). It is important to stress that these are not additional constraints 
that need to be imposed in order to guarantee the consistency of the string 
in spacetime; rather these constraints are intrinsic to string theory itself 
at the worldsheet level, emerging as string self-consistency constraints, and 
together imply these features in spacetime. 

We have already discussed the first contraint that governs the choices of 
the basis vectors: they must all have the form (6.7.9), with all right-moving 
fermions sharing the same boundary condition. Second, these vectors must 
all be linearly independent with respect to addition (modulo 1); otherwise, 
at least one of these vectors is redundant. The third constraint also turns 
out to be quite simple: among our set of basis vectors, we must always start 
with the vector 


Vo = [(3)*1G)"l. (6.7.20) 
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The presence of this vector ensures that the resulting string model contains 
at least a Ramond-Ramond sector in addition to a NS-NS sector. 
The remaining constraints serve to correlate the V; vectors with the 
GSO projection phases kij, and take the form: 
ki; thy = Vi- Vj 


kis + kio = Vi Vi- si. (6.7.21) 


Note that given a set of boundary condition vectors V;, the constraints 
(6.7.21) imply that only the elements k;; with i > j are independent param- 
eters. The first equation in (6.7.21) then enables us to uniquely determine 
kij with i < j, and the second equation in (6.7.21) enables us to uniquely 
determine the diagonal elements kii. 


6.7.4. Summary, examples, and generalizations 


Let us now summarize the rules for heterotic string model-building in D = 
10. We begin by choosing a set of linearly independent basis vectors V; 
(i = 0,...,N — 1) and a corresponding matrix of GSO projection phases 
kij (i,j = 0,..., N — 1). Our set of basis vectors may be as large as we 
desire; since each vector corresponds to an additional twist, larger sets of 
vectors lead to more complicated string models. Among our choice of basis 
vectors must always appear the vector Vo defined in (6.7.20), and every 
basis vector is required to have the form (6.7.9). We must also ensure that 
our choices of basis vectors V; and GSO projection phases kij are properly 
correlated according to (6.7.21). If there does not exist a solution for kij, 
then our original choice of V; must be discarded or repaired. These are the 
only constraints that govern the choices of the parameters {V;, kij }. 

Given such a self-consistent choice of parameters {V;, kij}, we are 
then guaranteed to have a self-consistent string model. The different sec- 
tors of this model are generated as all combinations D a;V; that fill 
out the twenty-dimensional lattice, where a; € {0,1}. In each sector 
aV = D a;V;, the allowed states are then those whose number operator 
vectors N simultaneously satisfy the constraints (6.7.18) for i = 0,..., N— 1. 
This is often called the spectrum-generating formula. 

It is straightforward to see how this formalism can be applied in practice. 
We shall leave it as an exercise to verify that the choice 


Vo = [(3)*1G3)""1, koo = (0) (6.7.22) 
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generates the non-supersymmetric S'O(32) heterotic string model; that the 
choice 


Vi = 10" 1G)" 0:0 


K OT pe Ge (6.7.23) 


generates the supersymmetric SO(32) heterotic string model; and that the 
choices 


Vo = (l Gs] 000 
Vi = [(0)*|(4)*9] kj = | 00k (6.7.24) 
V2 = [(0)*| (3) (0)"] ony 


generate the supersymmetric Eg x Eg string model if we choose k = 0, 
and the non-supersymmetric SO(16) x SO(16) string model if we choose 
k = 1/2. Indeed, it is a general property that if we choose our vector Vı 
as above, then spacetime supersymmetry is preserved if kio = kj for all 
i = 0,1,..., N — 1, and broken otherwise. Thus, we see that we now have 
a very compact notation and procedure for generating and analyzing ten- 
dimensional heterotic string models! We should also stress that these are 
not the only parameter choices of {V;, kij} that will lead to these mod- 
els. In fact, there is often a great redundancy in this procedure, so that 
a given physical string model can have many different representations in 
terms of the worldsheet parameters {V;, kij}. However, a given set of pa- 
rameters always corresponds to a single, unique, self-consistent string model 
in spacetime. 

The formalism that we have presented in this lecture is called the 
“free-fermionic construction”, and was developed in 1986 by H. Kawai, 
D.C. Lewellen, and S.-H.H. Tye and by I. Antoniadis, C. Bachas, and 
C. Kounnas. The name stems from the fact that the fundamental degrees 
of freedom on the string worldsheet (in addition to the spacetime coordi- 
nate fields X#) are taken to be the free fermionic fields UV. Even though 
we have presented this formalism for the case of ten-dimensional heterotic 
strings, there also exists a straightforward generalization of this formalism 
to four-dimensional heterotic string models. 

As we have indicated, this formalism also carries over directly to the 
case of multi-periodic complex fermions for which the boundary condition 
parameter v in (6.7.1) can be an arbitrary rational number in the range 
—$ <u< Ł, For each resulting boundary-condition vector V;, let us define 
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m; to be the smallest integer such that if we multiply each element in V; 
by m;, we obtain a vector of integer entries. In general, m; is called the 
“order” of the vector V;, and is also the order of the corresponding physical 
twist introduced by that vector. For example, in the case of only Neveu- 
Schwarz or Ramond fermions, we have m; = 2 for all i, implying only Ze 
twists. Nevertheless, even for general multi-periodic boundary conditions, 
the above constraints continue to apply exactly as written. Indeed, the 
only small change is that we now must take a; € {0,1,...,.m; — 1} when 
generating our lattice of corresponding string sectors. Likewise, each GSO 
projection phase k;; must now also be chosen such that mjkij € Z. 

In this regard, it is important to note that the only fermions which can 
possibly have such generalized boundary conditions are those which are not 
the worldsheet superpartners of worldsheet bosons. This restriction arises 
because the structure of the worldsheet supersymmetry algebra itself re- 
stricts the corresponding fermions to have only Neveu-Schwarz or Ramond 
boundary conditions. For example, in the case of the ten-dimensional het- 
erotic string, only the left-moving worldsheet fermions are a priori permit- 
ted to have generalized boundary conditions. By contrast, the right-moving 
fermions are restricted by the right-moving worldsheet supersymmetry al- 
gebra to have either Neveu-Schwarz or Ramond boundary conditions. This 
in turn implies that s; € {0, — 4}, so that a given string sector continues to 
give rise to only spacetime bosons or spacetime fermions. Also note that al- 
though we are capable in principle of utilizing multi-periodic fermions while 
constructing ten-dimensional heterotic string models, in practice it turns 
out that this does not lead to new models which are physically distinct from 
those using only Ramond or Neveu-Schwarz fermions. It is for this reason 
that we can ultimately restrict ourselves to these simpler boundary condi- 
tions in ten dimensions without loss of generality. In lower dimensions, by 
contrast, this is no longer true, and the number of possible models grows 
dramatically. 

This formalism can also be carried over to the case of ten-dimensional su- 
perstrings (rather than heterotic strings). For superstrings, the boundary- 
condition vectors take the simpler form 


Vi = [()* 1)" (6.7.25) 


where v,v € {0,5}. Our mandatory vector Vo then takes the form 
[($)*|(4)4], and we define s; = v+0 (mod 1) as our new spacetime statistics 


parameter, replacing (6.7.19). The results (6.7.21) and (6.7.18) then con- 
tinue to apply directly. Of course, this formalism is fairly trivial in the case 
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of ten-dimensional superstrings, for the maximal set of linearly independent 
basis vectors of the form (6.7.25) consists of only Vo and V: = [(0)4|($)4]. 
As we have seen in Lecture #4, this results in only four distinct super- 
string models in ten dimensions: omitting Vi from our basis set generates 
the Type 0 models, while including V; in our basis set generates the Type II 
models. However, just as for the heterotic strings, this formalism can also 
be generalized to the case of four-dimensional superstring models where the 
possibilities become much richer. 

It turns out that the free-fermionic formalism can be extended still fur- 
ther. For example, one can also extend this formalism to compactifications 
of the bosonic string. Moreover, one can even extend this formalism to 
special types of superstring and heterotic string models whose worldsheet 
actions must be represented in terms of real rather than complex fermions. 
Likewise, there even exist generalizations to string models involving non- 
free worldsheet fermions (7.e., models whose worldsheet actions involve ad- 
ditional Thirring-type interactions between the worldsheet fermions). In 
fact, even though there exist alternative model-construction formalisms 
that do not involve free worldsheet fermions at all, the free-fermionic con- 
struction can often yield models that are physically equivalent to those that 
are constructed through these other means. 

How general, then, is the free-fermionic construction? It turns out that 
for ten-dimensional string models, this construction is completely general. 
What this means is that all known physically consistent superstring and 
heterotic string models in ten dimensions can be realized via this construc- 
tion (i.e., as stemming from an underlying set of free-fermionic parame- 
ters {V;, ki; }). In lower dimensions, by contrast, this construction is not 
completely general — there exist self-consistent lower-dimensional string 
models which cannot be written or constructed in this manner. However, 
the free-fermionic construction does comprise a vast set of semi-realistic 
string models. Moreover, the free-fermionic construction has the great ad- 
vantage that the rules for construction are relatively simple, and that they 
enable one to systematically construct many string models and examine 
their phenomenological properties. Indeed, many computer programs have 
been written that use this formalism in order to scan the space of string 
models and analyze their low-energy phenomenologies. Thus, for these rea- 
sons, the free-fermionic construction has played a very useful role as the 
underlying method through which the majority of string model-building 
has historically been pursued. 
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6.7.5. Assessment: 


At this point, it is perhaps useful to assess the position in which we now 
find ourselves. Clearly, through these constructions, we are able to produce 
many string models. In fact, as we shall see, the number of self-consistent 
string models in D < 10 is virtually infinite, and there exists a whole space 
of such models. This space of models is called a moduli space, where the 
so-called moduli are various continuous parameters which can be adjusted 
in order to yield different models. (Of course, we have seen that we have 
only discrete parameter choices in ten dimensions, but these parameters can 
become continuous in lower dimensions.) Moreover, each of these models 
has a completely different spacetime phenomenology. What, then, is the 
use of string theory as an “ultimate” theory, if it does not lead to a single, 
unique model with a unique low-energy phenomenology? 

To answer this question, we should recall our discussion at the begin- 
ning of these lectures. Just as field theory is a language for building certain 
models (one of which, say, is the Standard Model), string theory is a new 
and deeper language by which we might also build models. The advantages 
of using this new language, as discussed in Lecture #1, include the fact that 
our resulting models incorporate quantum gravity and Planck-scale physics. 
Of course, in field theory, many parameters enter into the choice of model- 
building. These parameters include the choice of fields (for example, the 
choice of the gauge group, and whether or not to have spacetime supersym- 
metry), the number of fields (for example, the number of generations), the 
masses of particles, their mixing angles, and so forth. These are all space- 
time parameters. In string theory, by contrast, we do not choose these 
spacetime parameters; we instead choose a set of worldsheet parameters. 
For example, in the free-fermionic construction, we choose the parameters 
{Vi, kij}. All of the phenomenological properties in spacetime are then 
derived as consequences of these more fundamental choices. But still, just 
as in field theory, we are faced with the difficult task of model-building. 

Is this progress, then? While opinions on this question may differ, one 
can argue that the answer is still definitely “yes”. Recall that quantum 
gravity is automatically included in these string models. This is one of the 
benefits of model-building on the worldsheet rather than in spacetime. Also 
recall that string theory is a finite theory, and does not contain the sorts 
of ultraviolet divergences that plague us in field theory. This is another 
benefit of worldsheet, rather than spacetime, model-building. Moreover, 
worldsheet model-building ultimately involves choosing fewer parameters 
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than we would have to choose in field theory — for example, we have seen 
that an entire infinite tower of string states, their gauge groups and charges 
and spins, are all ultimately encoded in a few underlying worldsheet param- 
eters such as {V;, kij}. Furthermore, because of this drastic reduction in 
the number of free parameters, string phenomenology is in many ways more 
tightly constrained than ordinary field-theoretic phenomenology. Thus, it 
is in this way that string theory can guide our choices and expectations 
for physics beyond the Standard Model. Indeed, from a string perspective, 
we see that we should favor only those patterns of spacetime physics that 
can ultimately be derived from an underlying set of worldsheet parameters 
such as {V;, kij}. These would then serve as a “minimal set” of parameters 
which would govern all of spacetime physics! 

Of course, at a theoretical or philosophical level, this state of affairs 
is still somewhat unsatisfactory. After all, we still do not know which 
self-consistent choice of string parameters ultimately corresponds to real- 
ity. However, in principle, string theory should be able to predict this 
dynamically. Indeed, even though there exists a whole moduli space of self- 
consistent string models, there should exist an energy or potential func- 
tion in this space (i.e., some function V({$}) of all the moduli {¢}) which 
should dynamically select a particular point in moduli space (e.g., as a local 
or global minimum of V). This would then fix all of the moduli to specific 
values, or equivalently (in the language of the free-fermionic construction) 
tell us which choices of parameters {V;, kij} are preferred dynamically. 

Unfortunately, we do not understand the dynamics of string theory well 
enough to carry out such an ambitious undertaking. Certainly, at the level 
of perturbative (weakly coupled) string theory, we have no way to distin- 
guish amongst the possible low-energy models by calculating such a function 
V({¢}). This is particularly true for string models exhibiting spacetime su- 
persymmetry, for which V = 0 exactly to all orders in perturbation theory. 
Even if the spacetime supersymmetry is broken, the resulting potential 
V({¢}) often turns out not to have a stable minimum. This is the so-called 
“runaway problem”, to be discussed further in Lecture #8. Of course, 
one might hope that recent advances in understanding the non-perturbative 
structure of string theory will ultimately be able to provide guidance in 
this direction. However, as we shall discuss briefly in Lecture #8, although 
these non-perturbative insights (particularly those concerning string dual- 
ity) have thus far changed our understanding of the size and shape of this 
moduli space, they have not yet succeeded in leading us to an explanation 
of which points in this moduli space are dynamically selected. 
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So where do we stand? As string phenomenologists, we can do two 
things. First, we can pursue model-building: we can search through the 
moduli space of self-consistent string models in order to determine how close 
to realistic spacetime physics we can come. This is, in some sense, a direct 
test of string theory as a phenomenological theory of physics. Of course, this 
approach to string phenomenology is ultimately limited by many factors: we 
have no assurance that our model-construction techniques are sufficiently 
powerful or general to include the “correct” string model (assuming that 
one exists); we have no assurance that our model-construction techniques 
will not lead to physically distinct models which nevertheless “agree” as 
far as their testable low-energy predictions are concerned; and we have no 
assurance that the most important phenomenological features that describe 
our low-energy world (such as the pattern of supersymmetry-breaking) are 
to be found in perturbative string theory rather than in non-perturbative 
string theory. For example, it may well be (and it has indeed been ar- 
gued) that the true underlying string theory that describes nature is one 
which is intrinsically non-perturbative, and which would therefore be be- 
yond the reach of the sorts of approaches typically followed in studies of 
string phenomenology. 

Another option, then, is to temporarily abandon string model-building 
somewhat, and to seek to extract general phenomenological theorems or 
correlations about spacetime physics that follow directly from the general 
structure of string theory itself. Clearly, we would wish such information to 
be model-independent, i.e., independent of our particular location in mod- 
uli space or the values of particular string parameters such as {V;, kij}. For 
example, if some particular configuration of spacetime physics (some pat- 
tern of low-energy phenomenology) can be shown to be inconsistent with 
being realized from an underlying set of {V;, kij } parameters, and if such a 
demonstration can be made to transcend the particular free-fermionic con- 
struction so that it relies on only the primordial string symmetries them- 
selves, then such patterns of phenomenology can be ruled out. In this way, 
one can still use string theory in order to narrow the list of possibilities 
for physics at higher energies, and to correlate various seemingly discon- 
nected phenomenological features with each other. Such correlations would 
then be viewed as “predictions” from string theory, and we shall see many 
examples of this phenomenon in subsequent lectures. 

In summary, then, we have seen that there exist powerful ways of con- 
structing string models and surveying their low-energy phenomenologies, 
but that this leads to the problem of selecting the true model (i.e., the true 
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“ground state” or “vacuum”) of string theory. Despite recent advances in 
understanding various non-perturbative aspects of string theory, our in- 
ability to answer the fundamental question of vacuum selection persists. 
Until this challenge is overcome, string phenomenology therefore must con- 
tent itself with answering questions of a relative nature (such as questions 
concerning relative patterns of phenomenology) rather than the sorts of 
absolute questions (such as calculating the mass of the electron) that one 
would also ideally like to ask. Nevertheless, as we shall see, string the- 
ory can still provide us with considerable guidance for physics beyond the 
Standard Model. 


6.8. Lecture #8: A final lecture 


Up to this point, we have primarily discussed string model-building — i.e., 
the art of building string models. Hopefully, we have given the reader some 
sense of the complexity of the many constraints that are involved. In this 
final lecture, however, we shall depart from the somewhat “linear” devel- 
opment we have followed thus far in order to discuss string phenomenology 
— the study of the low-energy physical attributes of these models. 

In the first part of this final lecture, we shall outline some general prop- 
erties of four-dimensional heterotic string models. Then, we shall contrast 
these with the phenomenological properties of open-string D-brane models. 
Finally, we shall provide general comments concerning string phenomenol- 
ogy as a whole, and conclude with a brief discussion of some new, recent 
directions in string phenomenology. 


6.8.1. General properties of perturbative D = 4 heterotic 
string models 


In previous lectures, we have discussed the construction of perturbative 
heterotic string models. Here, we shall now turn the general low-energy 
properties that emerge from these constructions. 
First, such models all have big gauge groups. For perturbative heterotic 
strings in four dimensions, we find that 
rank(G) < 22. (6.8.1) 
This is the four-dimensional analogue of the observation that the maximum 
rank in 10 dimensions is 16, such as for the SO(32) and Eg x Eg heterotic 


strings. The additional six units of rank emerge from the Kaluza-Klein 
reduction from D = 10 to D = 4. 
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If the string model in question is “realistic”, then typically we can write 
G = Gy x Go (6.8.2) 


where Gi contains SU(3) x SU(2) x U(1)nypercharge- Here G is called the 
“observable-sector” gauge group: e.g., G1 could be SU(3) x SU(2) x U(1), 
SO(6) x SO(4), SU(5), SO(10), Es, etc. By contrast, G2 is called the 
“hidden-sector” gauge group. 

Second, there are typically lots of massless (“observable”) states! These 
can be classified into several categories: 


e Typical representations will carry charges under both G; and G2 
(i.e., transform as non-singlet representations of these groups). In 
general, we will only have spinors, vectors (i.e., fundamentals), and 
adjoints at the massless string level. (This is indeed a theorem: 
the allowed representations are closely tied to something called the 
“affine level” of the gauge group.) These states will typically fall 
into two subsets. First, there may be states that can be identified 
as (MS)SM quarks and leptons. In such cases, all gauge symmetry 
groups under which the SM gauge particles transform as singlets 
are considered to be part of Go, i.e., the hidden sector. Second, 
there can be extra states beyond the (MS)SM. There will typically 
be a lot of such states as well. They may be identified as exotic 
quarks and leptons. They will typically have fractional electric 
charge. This could cause problems (see below). 

e Many gauge-singlet states (i.e., states carrying no gauge charges) 
will also exist in the string model. For example, such states include 
the graviton, antisymmetric tensor, and dilaton ¢. Recall that 
string ~ exp(—(@)). Thus, the dilaton must be stabilized to yield 
a fixed value for the string coupling, and to avoid the so-called 
“dilaton runaway problem” (wherein (¢) — 00, or string — 0). 
The dilaton is just one example of a generic class of Lorentz-singlet 
particles called string “moduli”. The effective potential for such 
models is flat to all orders in perturbation theory. Thus, non- 
perturbative string effects must somehow introduce a potential for 
these fields, t.e., lift the degeneracy of string “ground states” and 
select a string vacuum. But how does this happen? This is a 
major unsolved problem, with lots of ideas in the literature. This 
is critically important for string phenomenology, since the vevs of 
the moduli set the values for gauge couplings, particle masses, and 
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so forth. Without knowing the values of these couplings, the best 
we can look for is string-constrained patterns (textures) in these 
parameters. 


Third, there will be infinite towers of Planck-scale string states! These 
states come in increasingly larger representations of gauge groups, and like- 
wise have higher and higher Lorentz spins. These states are the means by 
which string theory maintains finiteness. They propagate in all string loop 
diagrams, and their contributions cancel the divergences of the massless 
states. They are the result of conformal invariance (really its one-loop 
extension, called “modular invariance” ). 

One interesting fact about these states is that the number of such mas- 
sive states with spacetime mass M grows exponentially: 


gm ~ exp(cM Va’) (6.8.3) 


where c is a fixed positive constant. One of the implications of an exponen- 
tially growing degeneracy is as follows. Let us consider the thermodynamic 
partition function: 


Z = X` gm exp(-M/kT) (6.8.4) 
M 


where T is the temperature and k is Boltzmann’s constant. This gives: 


Z = X exp[M(cvaæ —1/kT)]. (6.8.5) 
M 


Thus, if T is bigger than a critical value 
T, = (kevat , (6.8.6) 


then the thermodynamic partition function diverges! This is the so-called 
“Hagedorn” phenomenon. 

Does this signal a phase transition? Or is there instead a limiting 
(“Hagedorn”) temperature for string theory beyond which one cannot go? 
What happens to a box of strings (i.e., the “universe” ) if we pump in lots 
of energy and try to raise the temperature? 

The answers to these questions are really not known. The current belief 
is that we have a phase transition in which all extra energy gets dumped 
into long string modes. But the nature of this phase transition is generally 
unclear. Indeed, this Hagedorn phenomenon is one of the central hallmarks 
of the the subject of string thermodynamics. As might be imagined, this 
subject is of critical importance for string cosmology and for string-based 
studies of the early universe. 
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Fourth, such heterotic string models will typically give rise to a single 
“pseudo-anomalous” U(1) gauge group! Recall that in field theory, given 
a set of states with U(1) charges Q;, we must have `; Q; = 0 in order to 
cancel axial (triangle) anomalies. In particular, certainly the hypercharge 
U(1) must be anomaly-free. 

However, in (many/most) heterotic string models, gauge groups are 
big and there can be extra U(1) gauge groups. One finds, upon summing 
over massless spectrum, that one of these gauge groups, typically denoted 
U(1)x, has corresponding states with charges Qx such that X Qx # 0. 
Thus, from the field-theory point of view, this U(1)x appears to be 
anomalous! 

In fact, however, this gauge group is not anomalous (since string the- 
ory is always anomaly-free); there are extra contributions to the appar- 
ent anomaly which come from anomalous transformations of the string 
“axion field” (related to the antisymmetric tensor B,,,) which cancel this 
anomaly. This is an intrinsically “stringy” mechanism (called the Green- 
Schwarz mechanism) for cancelling an anomaly. 

Fifth, such models typically give rise to automatic gauge coupling unifi- 
cation (regardless of existence of any GUT symmetry in the string model). 
In fact, the gauge couplings are even unified with the gravitational coupling! 

It is easy to understand why this is the case. Recall that our original 
“untwisted” four-dimensional heterotic string model has a “unified” gauge 
group SO(44), with one gauge gauge coupling whose value is set by the 
dilaton vev. (This is the analogue of SO(32) in ten dimensions.) When 
we break subsequently break the gauge symmetry by introducing twists 
(“orbifolding” ), this does not affect the gauge couplings. They are still all 
set by the same dilaton vev (ultimately because there is only one dilaton 
to which all gauge groups can couple). Thus, gauge coupling unification is 
automatic in heterotic string theory. 

One important question is the scale of the unification. Clearly, by 
dimensional analysis, this can be nothing but the string scale. At tree- 
level, we have already seen in Lecture #1 that the string scale is given by 
Mstring = GstringMPlanck. However, work by Kaplunovsky has shown that 
at one-loop order, and with the usual GUT assumption of gstring © 0.7, this 
result is shifted down to an approximate value, Mstring © 5.27 x 1017 GeV. 
This is generally a problem, since the expected GUT value for the uni- 
fication scale is Maur = 2 x 1016 GeV. How then do we explain this 
factor-of-20 discrepancy between Mstring and Maur? This is currently an 
open question, with many potential solutions. A comprehensive review of 
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this subject can be found in K.R. Dienes, Phys Reports 287 (1997) 447 = 
hep-th/9602045. 

Sixth, such models typically give rise to states with fractional electric 
charge. We already referred to this above. Indeed, extra states beyond the 
MSSM will typically have SU(2) x U(1) quantum numbers which imply non- 
integer values for the electric charge. In fact, one can prove (see theorem by 
A.N. Schellekens) that if the model has a gauge symmetry SU(3) x SU(2) x 
U(1) rather than a GUT, then the string will necessarily give rise to such 
fractionally charged states. This is a result of conformal invariance and 
modular invariance. 

One possible resolution to this problem is that such fractionally charged 
states might be able to confine to form integer-charged states under the 
influence of non-abelian gauge symmetries beyond the SM. However, if 
this is not possible in a given string model, then that model is generally 
considered to be phenomenologically inconsistent. 

General theorems exist which enable one to classify the different types of 
fractional charges one can expect to find in a given string model and which 
can be “confined” away. (We refer the reader to papers by Schellekens; also 
by Dienes, Faraggi, March-Russell.) 

Seventh, it turns out that such string models cannot contain any exact 
global symmetries! For example, in heterotic string theory, baryon- and 
lepton-number conservation, as well as other discrete symmetries, must all 
be parts of local symmetries (gauge symmetries) or be only approximate 
symmetries (i.e., accidental). 

Eighth, such heterotic string models will either exhibit spacetime su- 
persymmetry, or they will be non-supersymmetric. If non-supersymmetric, 
however, they nevertheless have a hidden symmetry called a “misaligned 
supersymmetry” which governs how the bosons and fermions are arranged 
at all mass levels so that finiteness is preserved, even without SUSY. Even 
the supertraces, when evaluated over the entire Fock space of string states, 
continue to vanish. [References include K.R. Dienes, Nucl. Phys. B429 
(1994) 533; K.R. Dienes, M. Moshe, and R.C. Myers, Phys. Rev. Lett. 74 
(1995) 4767.] 

However, it is not known whether such non-supersymmetric strings can 
ever be stable beyond tree level. This is an important open question in 
string theory. However, if such stable non-SUSY strings exist, then this 
could provide a whole new framework for thinking about the gauge hier- 
archy problem, SUSY-breaking, questions of finiteness, the role of effective 
field theories and in particular the massive Planck-scale states, and gauge 
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coupling unification. This may even provide an alternative, “stringy” ap- 
proach towards the hierarchy problem which does not involve either super- 
symmetry or extra spacetime dimensions. |For some speculative ideas along 
this direction, see K.R. Dienes, hep-th/0104274.] 

Ninth, the spacetime string spectrum can exhibit certain dualities. 
Indeed, there are several kinds of duality which, taken together, form 
an interconnected web of relations between different kinds of string 
theories. 


e One kind of duality is called “T-duality”. Consider string #1, com- 

pactified on a circle of radius R, and string #2, compactified on a 
circle of radius Va’/R. It turns out that these strings are indis- 
tinguishable, in the sense that they have exactly the same space- 
time spectrum! What would be considered a momentum state in 
string #1 would be considered a winding-mode state in string #2, 
and vice versa. This is clearly a very “stringy” symmetry! In fact, 
this symmetry transcends the mere tree-level spectrum, and holds 
to all orders. It also applies for all correlation functions, scattering 
amplitudes, both perturbatively and even non-perturbatively. This 
is an exact symmetry of closed string theories. 
One important implication of T-duality is that closed string the- 
ory (unlike point-particle field theory) cannot distinguish between 
large and small compactification radii! An interesting question is 
what this might imply about string cosmology. Likewise, what are 
the implications about our ultimate ability to derive effective field 
theories from the string? 

e There are also other kinds of dualities which exist amongst the 
different string theories. For example, there is a duality called “S- 
duality” which flips the sign of the dilaton and thus relates theories 
at weak coupling to theories at strong coupling! Under such a 
mapping, perturbative string states (such as the ones we have been 
considering all along) are exchanged with non-perturbative string 
states (which have not considered at all, but which are “solitons” 
= D-branes in the theory). Under this mapping, for example, the 
SO(32) heterotic (closed) string theory is mapped into the SO(32) 
Type I (open) string theory. Combined with T-duality, one finds 
that all the different kinds of ten-dimensional strings are ultimately 
related to each other, becoming part of a larger superstructure 
called “M-theory”. 
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The study of string dualities is a vast subject which easily deserves 
its own lectures, and which comprises the so-called “second” superstring 
revolution, dating from 1995. Indeed, insights have enabled us, in many 
cases, to “solve” for the strong-behavior of string theory! 

I cannot give a proper introduction to M-theory here, but I will simply 
give some general comments. M-theory is a conjectured eleven-dimensional 
theory (of strings? of membranes? — we don’t know) which can be defined 
through its three fundamental properties: 


e The low-energy limit of M-theory is eleven-dimensional SUGRA 
(recall that D = 11 is the maximum dimension for SUGRA). 

e Compactifying M-theory on a circle of radius R yields the Type IIA 
string with a coupling that is a growing function of R. So, at strong 
coupling, the Type IIA string begins to “see” an extra dimension 
and become eleven-dimensional. 

e Compactifying M-theory on a line segment of length L yields the 
Es x Eg heterotic string with a coupling that grows with L. This is 
why one does not see this 11th dimension in studies of the pertur- 
bative heterotic string: the very act of taking the string coupling 
to be small reduces the 11th dimension to zero size! 


Studying M-theory and its compactifications (and its phenomenological 
properties, such as how SUSY-breaking may be realized in this framework) 
has been a hot topic in the string literature. In particular, one may ask 
whether it is possible to compactify M-theory to four dimensions in ways 
that do not pass through an intermediate realization in terms of a D = 10 
heterotic string, thereby constructing new classes of four-dimensional string 
models? The answer is to this question is ‘yes’. Thus, even without knowing 
the precise nature of M-theory, it has already been possible to use insights 
gleaned from the mere existence of such a theory in order to generate new 
classes of string models. 

Taken together, these developments have led to the realization that 
many of our cherished “fundamental” string symmetries (such as confor- 
mal invariance, modular invariance, etc.) are only effective weak-coupling 
symmetries, applicable only for closed strings. Thus, as the string coupling 
grows in closed string theories, we expect to see deviations from the con- 
straints that come from these symmetries. This could be very useful in 
“freeing up” certain undesirable predictions of string phenomenology, even 
within closed strings. 
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6.8.2. General properties of D = 4 open-string models 


Many of the above phenomenological features of heterotic strings change 
when one deals with Type I theories (i.e., theories which include open 
strings). Some of the most viable models in this class that have chiral 
spectra include so-called “intersecting D-brane Models” as well as models 
with D-branes at singularities. Unfortunately, we do not have the space here 
to discuss such constructions. However, there are excellent reviews avail- 
able, In particular, we refer the reader to R. Blumenhagen, M. Cvetic, P. 
Langacker, and G. Shiu, hep-th/0502005 and to M. Grana, hep-th/0509003. 

There are many reasons to examine such Type I theories. Of course, 
they are interesting in their own right since they are among the possible 
allowed string constructions. However, as a result of the various string 
dualities discussed above, such strings often represent the strong-coupling 
limits of the heterotic models (this is “heterotic/Type I duality”, a com- 
ponent of S-duality). Thus, by studying Type I string models, one is of 
ten really analyzing the strong-coupling limit of a closed heterotic string 
model. 

We cannot provide a complete discussion of such Type I models here. 
However, the basic ideas are simple. Unlike heterotic string models, which 
realize their gauge symmetries along the closed strings through Kaluza- 
Klein reductions from 10 or 26 dimensions (as discussed above), gauge 
symmetries are realized in open strings through so-called “Chan-Paton fac- 
tors” which reside at the endpoints of the open strings. These are the 
analogues of “quarks” at the ends of the open strings, and they carry the 
gauge charges associated with the string states. 

Nowadays these Chan-Paton factors are reinterpreted as the labels asso- 
ciated with D-branes, so that open strings are considered to have endpoints 
which are restricted to lie on D-branes. Indeed, one definition of a D-brane 
is that it represents a solitonic membrane-like object on which an open 
strings can end. A single D-brane corresponds to a U(1) gauge symmetry 
(the corresponding photon being represented by an open string which starts 
and ends on the brane), while non-abelian U (N) gauge symmetries are re- 
alized through stacks of N coincident D-branes. In such configurations, 
the non-abelian gauge bosons are realized as strings which start and end 
on different D-branes within the stack. The Higgs mechanism (by which 
certain gauge symmetries can be broken and certain corresponding gauge 
bosons get heavy) can be realized in this framework by separating branes 
within the stack; those strings which start and end on different D-branes 
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get stretched as a result of this separation, and thus become massive as a 
result of the tension involved in that stretching. 

In general, within such constructions, one might realize the Standard 
Model through an SU(3) stack of D-branes and an SU (2) stack of D-branes. 
In such a scenario, quarks (which carrying non-trivial SU (3) and U(2) gauge 
charges) would be represented by strings stretching from the SU(3) stack 
to the SU(2) stack; such states can indeed be light (or massless) if these 
stacks of branes intersect, and the strings lie near that intersection. Of 
course, gravitational physics continues to be represented by closed strings 
which, having no endpoints, are not tied to particular branes and can there- 
fore propagate freely in the “bulk”. In general, only those states which are 
neutral with respect to all gauge symmetries (such as gravitons) are per- 
mitted to wander freely in the entire volume both within and transverse 
to the branes. In certain constructions, other possible closed-string states 
might include right-handed neutrinos (which are also completely neutral 
with respect to all Standard-Model gauge symmetries). 

In general, the requirements of spacetime supersymmetry imply that the 
theory contain combinations of D-branes of only certain dimensionalities; 
likewise, the relative positions and/or geometric intersections of these D- 
branes are highly constrained. There are also generally other extended 
objects in these theories (beyond D-branes): these include anti-Dbranes, 
orientifold planes, and other types of branes (such as NS branes). Anomaly 
cancellation considerations end up playing a huge role in determining which 
configurations of all of these objects are required to form self-consistent 
string models. 

Other than these constraints, however, one has tremendous freedom in 
designing D-brane configurations, compactifying the theory, wrapping the 
D-branes around the compactification manifolds, and so forth. This is then 
the art of Type I model building. Because of the tremendous range of al- 
lowed D-brane configurations and dimensionalities, and because the closed- 
string and open-string sectors have very different properties, Type I string 
phenomenology turns out to be very rich and unconstrained compared to 
heterotic string phenomenology. 

In particular, even without providing details concerning such construc- 
tions, it is possible to summarize some of the major phenomenological dif- 
ferences between these string models and the closed (heterotic) strings dis- 
cussed above. 

First, the rank of the gauge group no longer restricted to 22! Indeed, 
non-perturbative effecs can give rise to new gauge interactions that can 
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increase the total rank beyond 22, and there is no bound to how large 
these gauge groups can become! (You can decide for yourself whether you 
consider this to be a good thing...) 

Second, the fundamental scale of the theory (Mstring) is no longer tied 
to Mpjanck- The usual heterotic relation Mstring = 9string MPlanck no longer 
applies to open strings. The reason is that for closed strings, both gauge 
forces and the gravitational force emerge together. However, for Type I 
strings, the gravitational force emerges from the closed-string sector, while 
the gauge forces typically emerge from the open-string sectors. This differ- 
ence introduces an undetermined “rescaling” factor between the different 
sectors, and therefore allows one to “dial” Mstring as we wish in such the- 
ories. [For more details, see Chapter 10 of K.R. Dienes, Phys Reports 287 
(1997) 447 = hep-th/9602045, which summarizes the original proposal of 
Witten: E. Witten, Nucl Phys B 471, 135 (1996)]. One could conceivably 
dial the Type I string scale all the way down to the TeV range — see, e.g., 
J. Lykken, PRD 54, 3693 (1996); K.R. Dienes, E. Dudas, T. Gherghetta, 
Nucl. Phys. B537, 47 (1999); G. Shiu, S.-H.H. Tye, Nucl. Phys. B548, 180 
(1999). 

This freedom to adjust the string scale and realize the Standard Model 
as an open string living on a brane while gravitational fields correspond to 
closed strings living in the bulk is the primary reason why Type I strings 
provide the natural realization (and inspiration) for extra-dimensional 
“brane-world” scenarios. 

Third, for weakly coupled heterotic strings, there is only one dilaton-like 
field which couples to all gauge groups and matter fields in a universal way. 
However, in Type I theories there can generally be multiple dilaton-like 
fields. 

Fourth, for Type I theories, gauge coupling unification is no longer auto- 
matic. This is a consequence of the existence of multiple dilaton-like fields. 
Each gauge coupling can be determined by the vev of a different dilaton 
field, and likewise the gauge theories living on different D-branes can ex- 
perience different transverse volumes which also affect the values of their 
respective gauge couplings. 

Fifth, in heterotic strings, there was only one anomalous U(1) because 
there was only one dilaton to cancel this anomaly through the Green- 
Scwharz mechanism. However, in Type I theories there can be multiple 
anomalous U(1)’s because the presence of multiple dilatons in Type I the- 
ories implies that there can be a generalized Green-Schwarz mechanism 
which cancels multiple U(1) anomalies. 
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Sixth, it turns out that whole new types of spacetime compactifica- 
tions are possible. In heterotic strings, one must compactify on a so-called 
“Calabi-Yau” manifold if one wishes to preserve N=1 spacetime supersym- 
metry. (See Polchinski’s textbook for a complete discussion: technically CY 
manifolds are six-dimensional complex manifolds with SU(3) holonomy or 
equivalently vanishing first Chern class.) While the simple cases of tori 
(and orbifolds thereof) are well understood, the general full class of CY 
manifolds is not well understood (not even classified by mathematicians) 
and it is hard to perform detailed calculations of the resulting low-energy 
phenomenologies that emerge when heterotic strings are compactified on 
such spaces. 

Type I string models are different. Because the matter arises locally (on 
branes) rather than globally (in the bulk), the compactification geometry is 
less constrained. For example, chirality no longer requires compactification 
on an orbifold, since chirality can instead emerge directly from D-brane 
intersections even when the compactification space is a smooth manifold. 

For further discussions of these differences between the phenomenolo- 
gies of open and closed strings, good references are: L.E. Ibanez, hep- 
th/9804236; F. Quevedo, Trieste String School Lectures, March 2002. 


6.8.3. String model-building and string phenomenology: 
General practice and goals 


Having discussed the different types of phenomenological features of these 
different types of string models, we now outline the basic way in which 
the string model-building game is played. Of course, the following steps 
are merely caricatures, with many details omitted. Nevertheless, they do 
indicate the rough methodology that a string phenomenologist must follow 
in order to claim to have a realistic string model. 

The first step, as always, is to build the candidate string model itself. 
We have discussed how to do this in great detail in previous lectures. This 
is the string “model-building” aspect of string phenomenology. How one 
goes about doing this will depend on the particular string framework one 
has in mind, whether closed or open strings are involved, whether one is 
dealing with perturbative or non-perturbative constructions, and so forth. 
Each construction will carry with it its own constraints, its own techniques, 
and its own unique advantages and difficulties. 

Once one has a particular string model in hand, one then extracts the 
gauge symmetry, the particle content (massless spectrum only, if one cares 
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only about questions pertaining to observable low-energy states), and all 
associated charges and couplings. 

The next step, if necessary (such as in heterotic strings), is to do a so- 
called “string vacuum shift”. This is a technical step. Recall that there 
often exists a pseudo-anomalous U(1)x gauge symmetry. Although this 
is not really anomalous, it leads to an effective Fayet-Iliopoulos D-term 
which can break spacetime SUSY and destabilize the string vacuum. So, in 
order to “fix” this problem, one shifts the ground state slightly: one assigns 
a vev to certain moduli in the theory in order to break the U(1)x gauge 
symmetry and cancel the D-term. This makes the model stable again. This 
vev may often also break other gauge symmetries in addition to U(1)x. It 
also can generate intermediate mass scales for various light states in the 
string model. 

The third step is to write down an effective Lagrangian of these light 
fields that are derived from the string. Typically we will write something 
of the form 


L = LSUGRA g Lmatter + L couplings : (6.8.7) 
These different pieces are as follows. 


e Lsucra: This must be appropriate for the given model in question, 
e.g., N = 0 (non-susy), or N = 1, or Type IIA or IIB SUGRA, etc. 

e Lyatter: This will consist of the kinetic terms for all light fields 
(including an appropriate dilaton dependence). 

e Leouplings: Here, we must include all couplings allowed by string 
symmetries, given the charges that these states have under both ob- 
servable and hidden-sector gauge symmetries (i.e., selection rules). 
These will include renormalizable and non-renormalizable cou- 
plings, where the non-renormalizable ones are suppressed by powers 
of the string scale Mgtring. In principle, one should calculate the 
coefficients that pre-multiply these terms by explicitly evaluating 
the appropriate corresponding string diagrams. 


All together, this is the “effective Lagrangian from the string model”. 
One must make sure that it is consistent with all string symmetries (e.g., 
T-duality, S-duality, others) if you are going to ask physics questions for 
which those symmetries are likely to be important. 

The final step is to proceed to analyze the physics of the string model by 
analyzing the effective field theory of the effective Lagrangian derived from 
the string. We treat this Lagrangian as describing the physics at the string 
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scale, and use RGE’s to pass to lower energy scales (as we would in ordi- 
nary field theory). Along the way (i.e., at intermediate scales), various new 
features can arise. For example, although Standard Model gauge groups 
will hopefully stay perturbative, the hidden-sector gauge couplings may, 
depending on the particle content, become strong and non-perturbative 
at some intermediate scale. This can trigger the corresponding gauginos 
to condense (“gaugino condensation”). which in turn can trigger SUSY- 
breaking. This is indeed an elegant string-inspired but field-theoretic means 
of breaking SUSY at intermediate energy scales. Likewise, extra matter 
beyond the MSSM (with masses determined by vacuum shifting, as dis- 
cussed) can decouple. Clearly, the analysis for this sep is generally very 
model-dependent! 

Ultimately, we seek to reproduce the low-energy world at the TeV-scale 
— i.e., we wish to reproduce the Standard Model, and then study the phe- 
nomenological implications of the extra string-inspired particles or interac- 
tions that are predicted at higher scales. For example, one might construct 
string GUT models (realizing standard field-theory GUT scenarios from 
string theory), or realize the Standard Model directly at the string scale 
without an intervening GUT, or... 

Given this procedure as outlined, one might wonder what the goals of 
string phenomenology ultimately are. Is it sufficient to try to construct 
semi-realistic string models, or are there are other goals as well? While a 
conversation on this topic can easily yield as many opinions as there are 
people in the conversation, the following represent the personal opinions 
of your humble lecturer. Therefore, the reader is forewarned about the 
potential bias of the lecturer. 

Clearly, one important and undeniable goal must be to try to construct 
realistic string models, i.e., to see how far one can really “push” the em- 
bedding of the low-energy world into string theory, to test the extent to 
which one can really make string theory consistent with the real world. 

Unfortunately, this is very model-dependent. Also, given the large (in- 
finite) “moduli” space of all possible string models, it is hard (impossible?) 
to believe that we would really be lucky enough to stumble across the right 
string model (assuming one exists). 

Also, although we discussed one particular method of model- 
construction in these lectures (the so-called “free-fermionic construction” 
for closed strings), its applications and scope are limited (it essentially only 
hits discrete points in moduli space). They are points of enhanced symme- 
try, so they may indeed be special, but we don’t know the structure of mod- 
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uli space well enough to have a feeling for whether other, more compelling 
points might exist. And for open strings, we have seen the possibilities are 
even more varied! 

Therefore, an alternative goal might be to try to uncover model- 
independent phenomenological truths from string theory. For example, one 
might ask questions such as 


e What “patterns” of low-energy phenomenology are consistent with 
coming from or being realized from an underlying string theory? 

e What “patterns” of low-energy phenomenology can be excluded? 

e What sorts of “correlations” does string theory predict between 
phenomenological features that would otherwise appear to be com- 
pletely independent from a field theory point of view? As an exam- 
ple, string theory predicts correlations between gauge groups and 
fractional charges, etc. These correlations are ultimately the reflec- 
tions of the deeper string symmetries (i.e., worldsheet symmetries) 
from which all spacetime physics is ultimately derived. For further 
editorializing along these lines, see the comments about the string 
landscape at the end of this lecture. In this way, we can then ask 
the question: 

e What guidance does string theory provide for answering or address- 
ing questions of physics beyond the Standard Model? 


Of course, string theory also has the potential to provide insights of a 
completely different nature. For example, just as field theory provides cer- 
tain mechanisms for addressing long-standing questions of particle physics, 
string theory (viewed as a general theory of extended objects) has the po- 
tential to provide new, additional, intrinsically geometric mechanisms for 
solving some of these same problems. Moreover, these mechanisms may 
also be able to generate new approaches to solving long-statnding problems 
that ordinary field theories based on point particles cannot reach. Thus, 
string phenomenology may be able to enlarge the domain of problems that 
a particle physicist might hope to address, and provide new tools for this 
endeavor. 

But finally, perhaps the most important unresolved problem within 
string phenomenology is to understand what selects the string vacuum. 
Clearly, in order to make full progress in our understanding of string the- 
ory and its low-energy phenomenological predictions, we must eventually 
uncover the dynamics (presumably non-perturbative or semi-perturbative, 
or involving a mix of perturbative and non-perturbative physics) which 
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ultimately pushes the universe towards the true ground state of string the- 
ory, the one in which we live. 

Progress along these lines will be very hard, but is very important. Per- 
haps insight will come (or even may be coming) from recent developments 
in string duality. This problem seems tied up with the whole issue of how 
SUSY is broken, and the cosmological constant problem, so it is likely to 
take some time. 


6.8.4. New/current directions in string phenomenology 


We close this final lecture with a brief discussion of three new/current 
directions in string phenomenology. Once again, the following list is hardly 
complete. However, it does capture several of the main thrusts of string 
phenomenology research over the past few years, and the directions which 
are likely to hold the attention of string phenonenologists for the forseeable 
future. 


Large-radius compactifications / TeV-scale strings 


Many of you probably consider higher-dimensional “brane world” sce- 
narios as something separate from string theory. But in truth, much of 
this work is really a branch of string phenomenology: one is studying the 
properties of string theories in a corner of the parameter space where the 
compactification radii are large, or where the Standard Model is restricted 
to a brane (stack) as in Type I models! Indeed, the whole setup of much of 
this work (SM restricted to a brane, gravity propagating in the bulk, and so 
forth) really emerges from Type I string theories where the SM is realized 
through open strings (whose endpoints therefore must lie on D-branes) and 
the graviton is realized through closed strings (which have no endpoints and 
which are therefore free to wander throughout the full higher-dimensional 
spacetime). 

Thus, when one studies issues of flavor physics or develops new higher- 
dimensional mechanisms for understanding hierarchies, supersymmetry 
breaking, proton stability, etc, one is really developing an understanding 
of the phenomenology of open strings in a particular corner of compacti- 
fication parameter space! In other words, the brane world is nothing but 
a branch of string phenomenology, studied through an effective field the- 
ory approach which might ultimately emerge from an underlying (Type I) 
string. 

In addition to Kaustubh Agashe’s excellent lectures on this subject at 
this year’s TASI school, I recommend (of course) my own TASI 2002 lectures 
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on the brane world: K.R. Dienes, 2002 TASI Lectures: New Directions for 
New Dimensions: An Introduction to Kaluza-Klein Theory, Large Extra 
Dimensions, and the Brane World”, available at 
http://scipp.ucsc.edu/haber/tasi_proceedings/dienes.ps. 


Flux compactifications 


Another line of intense research in recent years concerns the possibility 
of so-called flux compactifications. There are compactifications in which 
various background fluxes associated with different p-form gauge fields in 
the theory are actually turned on. (Previous work had always assumed 
that such fluxes were zero.) It turns out that turning on such fluxes has a 
number of important effects. For example, the constraints on the allowed 
compactification geometries are modified, and the extra flux contributions 
allow us to go beyond the simple class of Calabi- Yau compactifications. 

However, the most important phenomenological aspect of such flux com- 
pactifications is that they provide a framework leading to new methods 
of moduli stabilization. Indeed, within the framework of flux compacti- 
fications, it has been been possible to build semi-realistic string models 
in which the vast majority of complex and Kahler moduli are completely 
frozen! 

Flux compactifications thus provide a new arena in which to address the 
all-important issues of moduli stabilization and vacuum selection. Indeed, 
work of Kachru, Kallosh, Linde, and Trivedi (KKLT) has even provided a 
framework in which it might be possible to realize meta-stable string vacua 
with deSitter (dS) geometries. This is of critical importance if string theory 
is to make contact with cosmological evolution. 


The string theory “landscape” 


Finally, as we have seen repeatedly throughout these lectures, one of 
the most serious problems faced by practitioners of string phenomenology 
is the multitude of possible, self-consistent string vacua. That there exist 
large numbers of potential string solutions has been known since the ear- 
liest days of string theory; these result from the large numbers of possible 
ways in which one may choose an appropriate compactification manifold (or 
orbifold), an appropriate set of background fields and fluxes, and appropri- 
ate expectation values for the plethora of additional moduli to which string 
theories generically give rise. Although historically these string solutions 
were not completely stabilized, it was tacitly anticipated for many years 
that some unknown vacuum stabilization mechanism would ultimately lead 
to a unique vacuum state. Unfortunately, recent developments suggest 


376 Keith R. Dienes 


that there continue to exist huge numbers of self-consistent string solutions 
(i.e., string “models” or “vacua”) even after stabilization. Thus, a picture 
emerges in which there exist huge numbers of possible string vacua, all po- 
tentially stable (or sufficiently metastable), with apparently no dynamical 
principle to select amongst them. Indeed, each of these potential vacua can 
be viewed as sitting at the local minimum of a complex terrain of possible 
string solutions dominated by hills and valleys. This terrain has come to 
be known as the “string-theory landscape”. 

The existence of such a landscape has tremendous practical significance 
because, as we have seen, the specific low-energy phenomenology that can 
be expected to emerge from string theory depends critically on the partic- 
ular choice of vacuum state. Detailed quantities such as particle masses 
and mixings, and even more general quantities and structures such as the 
choice of gauge group, number of chiral particle generations, magnitude of 
the supersymmetry-breaking scale, and even the cosmological constant can 
be expected to vary significantly from one vacuum solution to the next. 
Thus, in the absence of some sort of vacuum selection principle, it is nat- 
ural to tackle a secondary but perhaps more tractible question concerning 
whether there might exist generic string-derived correlations between dif- 
ferent phenomenological features. In this way, one can still hope to extract 
phenomenological predictions from string theory. 

Over the past two years, this idea has triggered a surge of activity con- 
cerning the statistical properties of the landscape. Investigations along 
these lines have focused on diverse phenomenological issues including the 
value of the supersymmetry-breaking scale, the value of the cosmological 
constant, and the preferred rank of the corresponding gauge groups, the 
prevalence of the Standard-Model gauge group, and possible numbers of 
chiral generations. Discussions of the landscape have also led to various 
theoretical paradigm shifts, ranging from alternative landscape-based no- 
tions of naturalness and novel cosmological inflationary scenarios to the use 
of anthropic arguments to constrain the set of viable string vacua. There 
have even been proposals for field-theoretic analogues of the string-theory 
landscape, as well as discussions concerning whether a landscape of suffi- 
ciently stable string vacua actually exists. 

The implications of a landscape (if it exists) have been hotly debated 
in the string community. Undoubtedly, if the string landscape exists, it 
is a very rich place, full of unanticipated properties and characteristics. 
Nevertheless, at the very least, the possible existence of such a landscape has 
focused the attention of the string community on the fundamental question 
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which has plagued string theory over the past twenty years, namely the 
issue of vacuum selection. 

One might argue that the landscape is simply too large to permit any 
reasonable analysis. Indeed, one might even argue that if such a landscape 
exists, string theory is doomed as a predictive theory of physics, and that 
the answers to some of the most fundamental questions in physics might 
find their answers in random environmental selection (or as the result of 
cosmological chance). 

However, it is also true that the direct examination of actual string mod- 
els uncovers features and behaviors that might not otherwise be expected. 
Moreover, through direct enumeration, we gain valuable experience in the 
construction and analysis of phenomenologically viable string vacua. Fi- 
nally, as string phenomenologists, we must ultimately come to terms with 
the landscape (if it exists). Just as in other fields ranging from astrophysics 
and botany all the way to zoology, the first step in the analysis of a large 
data set is enumeration and classification. Indeed, this is how science be- 
gins. Thus, properly interpreted, statistical landscape studies might be 
useful and relevant in this overall endeavor of connecting string theory to 
the real world. 
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Chapter 7 


Theoretical Aspects of Neutrino Masses and Mixings 


R. N. Mohapatra 


Department of Physics, University of Maryland, College Park, MD, 
20742, USA 


Neutrino oscillation experiments have yielded valuable information on 
the nature of neutrino masses and mixings and have provided the first 
glimpse of new physics beyond the standard model. Even though we are 
far from a complete understanding of the new physics implied by them, 
some tell-tale hints are emerging which have narrowed the direction of 
the new physics and have provided some insight into the flavor problem. 
In these lectures, I provide a panoramic overview of the current thinkings 
in neutrino model building. 


7.1. Introduction 


For a long time, it was believed that neutrinos are massless, spin half parti- 
cles, making them drastically different from their other standard model 
spin half cousins such as the charged leptons (e,u,7) and the quarks 
(u, d, s,c,t,b), which are known to have mass. This myth has however been 
shattered by the accumulating evidence for neutrino mass from the solar 
and atmospheric neutrino observations compiled in the nineties as well as 
several terrestrial experiments in the new century. One must therefore now 
be free to look beyond the massless neutrino idea to explore new physics 
as we proceed to understand the neutrino mass. 

The possibility of a nonzero neutrino mass at phenomenological level 
goes back almost 50 years. In the context of gauge theories, they were 
discussed extensively in the 70’s and 80’s long before there was any firm 
evidence for it. For instance the left-right symmetric theories of weak in- 
teractions introduced in 1974 and discussed in those days in connection 
with the structure of neutral current weak interactions, predicted nonzero 
neutrino mass as a necessary consequence of parity invariance and quark 
lepton symmetry. 
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The existence of a nonzero neutrino mass makes neutrinos more like 
the quarks, and allows for mixing between the different neutrino species 
leading to the phenomenon of neutrino oscillation, an idea first discussed 
by Pontecorvo! and Maki et al.’ in the 1960’s, unleasing a whole new realm 
of particle physics phenomena to explore beyond the standard model. At 
the present time, we are of course far from a complete picture of the masses 
and mixings of the various neutrinos and cannot therefore have a full outline 
of the theory of neutrino masses. However there exist enough information 
that we can surmise some viable possibilities for the theories beyond the 
standard model. Combined with other ideas outside the neutrino arena such 
as supersymmetry and unification, the possibility narrows even further. 
Many clever experiments now under way will soon clarify or rule out many 
of the allowed models. In these lectures, I give a panoramic view of what 
we may have learned about physics beyond the standard model and new 
symmetries of flavor as we attempt to understand neutrino masses. 

For simplicity, I will focus on a widely discussed framework for un- 
derstanding of the small neutrino masses, the seesaw mechanism, which 
employs a minimal extension of the standard model by adding two or three 
right handed neutrinos, which are super-heavy and Majorana type. I will 
touch briefly on some specific models that are based on the above general 
framework but attempt to provide an understanding of the detailed mass 
and mixing patterns using family symmetries which must supplement the 
seesaw mechanism. We will also present a class of SO(10) grand unified 
theories where there is no need for family symmetries to understand large 
mixings. These works are instructive for several reasons: first they provide 
an existence proof that this is a sensible way to proceed in tackling the hard 
problem of understanding large lepton mixings; second they often illustrate 
the kind of assumptions needed and through that provide a unique insight 
into which directions the next step should be; finally of course nature may 
be generous in picking one of those models as the final message bearer. 

The fact that the neutrino has no electric charge endows it with certain 
properties not shared by other fermions of the standard model. One can 
write two kinds of Lorentz invariant mass terms for the neutrino, the Dirac 
and Majorana masses, whereas for the charged fermions, conservation of 
electric charge allows only Dirac type mass terms. In the four component 
notation for the fermions, the Dirac mass has the form ww, whereas the 
Majorana mass is of the form ~7C~!w, where w is the four component 
spinor and C is the charge conjugation matrix. One can also discuss the 
two different kinds of mass terms using the two component notation for the 


Theoretical Aspects of Neutrino Masses and Mixings 381 


spinors, which provides a very useful way to discuss neutrino masses. We 
therefore present some of the salient concepts behind the two component 
description of the neutrino. 


7.1.1. Two component notation for neutrinos 


Before we start the discussion of the 2-component neutrino, let us write 
down the Dirac equation for an electron:? 


iw — mp = 0 (7.1) 
This equation follows from a free Lagrangian 
L = iby dw — mb (7.2) 


and leads to the relativistic energy momentum relation p*p, = m? for the 
spin-half particle only if the four y,’s anticommute. If we take ’s to 
be n x n matrices , the smallest value of n for which four anticommuting 
matrices exist is four. Therefore ~ must be a four component spinor. The 
physical meaning of the four components is as follows: two components for 
particle spin up and down and same for the antiparticle. 

A spin-half particle is said to be a Majorana particle if the spinor field 
w satisfies the condition of being self charge conjugate, i.e. 


p= =09", (7.3) 


where C is the charge conjugation matrix and has the property Cy,C7! = 
a", This constraint reduces the number of independent components of 
the spinor by a factor of two, since the particle and the antiparticle are now 
the same particle. Using this condition, the mass term in the Lagrangian 
in Eq. (2) can be written as YTO ty, where we have used the fact that C 
is a unitary matrix. Writing the mass term in this way makes it clear that 
if a field carries a U(1) charge and the theory is invariant under those U(1) 
transformations, then the mass term breaks this symmetry. This means 
that one cannot impose the Majorana condition on a particle that has a 
gauge charge. Since the neutrinos do not have electric charge, they can be 
Majorana particles unlike the quarks, electron or the muon. It is of course 
well known that the gauge boson interactions in a gauge theory Lagrangian 
conserve a global U(1) symmetry known as lepton number with the neutrino 
and electron carrying the same lepton number. If lepton number were to 
be established as an exact symmetry of nature, the Majorana mass for the 
neutrino would be forbidden and the neutrino, like the electron, would be 
a Dirac particle. 


382 R. N. Mohapatra 


The properties of a Majorana fermion can be seen in its free field ex- 
pansion in terms of creation and annihilation operators: 


dp —ip r t ip- x£ 
w= > (as(p)us(p)e P? + alu.(p)e” ). (7.4) 


In the gamma matrix convention where y; = ( 2 a and yo = (5 a 
-ci 


the us and vs are given by 


up) = 5 (285, ) (75) 


and 
(7.6) 


as and a’, are two component spinors. 

If we choose a’, = 0205, we get the relation among the spinors u,(p) 
and vs(p) Cyou*(p) = vs(p) and the Majorana condition follows. Note that 
if w were to describe a Dirac spinor, then we would have had a different 
creation operator b? in the second term in the free field expansion above. 

The origin of the two component neutrino is rooted in the isomorphism 
between the Lorentz group and the SL(2, C) group. The latter is defined as 
the set of 2 x 2 complex matrices with unit determinant, whose generators 
satisfy the same Lie algebra as that of the Lorentz group. Its basic repre- 
sentations are 2 and 2* dimensional. These are the spinor representations 
and can be used to describe spin half particles. 

We can therefore write the familiar 4-component Dirac spinor used in 
the text books to describe an electron can be written as ọ = ee ), 

2 
where x and ¢ two two component spinors. A Dirac mass is the given by 
x? o2 whereas a Majorana mass is given by x? o2y, where oq are the Pauli 
matrices. To make correspondence with the four component notation, we 
point out that ¢@ and io2x* are nothing but the wz and wp respectively. 
It is then clear that x and @ have opposite electric charges; therefore the 
Dirac mass yT 02 maintains electric charge conservation (as well as any 
other kind of charge like lepton number etc.). 

2-component neutrino is described by the following Lagrangian: 


i im ; im _; 
L = viiordyy — ge n ow + ae an f (7.7) 
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This leads to the following equation of motion for the field x 
io*O,x — imozx* = 0. (7.8) 


As is conventionally done in field theories, we can now give a free field 
expansion of the two component Majorana field in terms of the creation 
and annihilation operators: 


x(x, t) = So [ap,sap, se? pg ab eae, (7.9) 


where the sum on s goes over the spin up and down states. 


Exercise 1: Using the field equations for a free massive two component 
Majorana spinor, show that its expansion in terms of the creation and 


1 
annihilation operators and two component spinors a = ( 5) and 3 = o) 


is given by the following expression: 


xe) = J lappe ah etta] E+ p 
p 
+ X fap, -e + al, ,e? =a] Ep. (7.10) 
p 
Note that in a beta decay process, where a neutron is annihilated and 
proton is created, the leptonic weak current that is involved is ev (drop- 
ping gamma matrices); therefore, along with the electron, what is cre- 
ated predominantly is a right handed particle (with a wave function a), 
the amplitude being of order YVE Fp ~ V2E. This is the right handed 
anti-neutrino. The left handed neutrino is produced with a much smaller 
amplitude YVE =p ~ m,/E. Similarly, in the fusion reaction in the core 
of the Sun, what is produced is a left handed state of the neutrino with 
a very tiny i.e. O(m,/E) admixture of the right handed helicity (or the 
“anti-neutrino” component). 


7.1.2. Neutrinoless double beta decay and neutrino 
Majorana mass 


As already noted a Majorana neutrino breaks lepton number by two units. 
This has the experimentally testable prediction that it leads to the process 
of neutrino-less double beta decay, that involves the decay of an even-even 
nucleus i.e. (Z, N) > (Z +2, N —2)+2e7. We will now show by using the 
above property of the Majorana neutrino that if light neutrino exchange 
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is responsible for this process, then the amplitude is proportional to the 
neutrino mass. 

Double beta decay involves the change of two neutrons to two protons 
and therefore has to be a second order weak interaction process. Since each 
weak interaction process emits an antineutrino, in second order weak inter- 
action, the final state will involve two anti-neutrinos. But in neutrino-less 
double beta decay, there are no neutrinos in the final state; therefore the two 
neutrinos must go into the vacuum state. Vacuum state by definition has no 
spin whereas the antineutrino emitted in a beta decay has spin. Consider 
the antineutrino from one of the decays: it must be predominantly right 
handed. But to disappear into vacuum, it must combine with a lefthanded 
antineutrino so that the left and right handed spin projections add up to 
zero. In the previous paragraph, we showed that the fraction of left handed 
spin projection in a neutrino emitted in beta decay is m,/E. Therefore, 
DeDe — |0 > must be proportional to the neutrino mass. Thus neutrino-less 
double beta decay is therefore a very sensitive measure of neutrino mass. 


7.1.3. Neutrino mass in two component notation 


Let us now discuss the general neutrino mass for Majorana neutrinos. We 
saw earlier that for Majorana neutrinos, there are two different ways to 
write a mass term consistent with relativistic invariance. This richness in 
the possibility for neutrino masses also has a down side in the sense that 
in general, there are more parameters describing the masses of the neutri- 
nos than those for the quarks and leptons. For instance for the electron 
and quarks, dynamics (electric charge conservation) reduces the number of 
parameters in their mass matrix. As an example, using the two compo- 
nent notation for all fermions, for the case of two two component spinors, a 
charged fermion mass will be described only by one parameters whereas for 
a neutrino, there will be three parameters. This difference increases rapidly 
e.g. for 2N spinors, to describe charged fermion masses, we need N? pa- 
rameters (ignoring CP violation) whereas for neutrinos, we need aNGNY)) 
parameters. What is more interesting is that for a neutrino like particle, 
one can have both even and odd number of two component objects and 
have a consistent theory. 

In this article, we will use two component notation for neutrinos. Thus 
when we say that there are N neutrinos, we will mean N two-component 
neutrinos. 

In the two component language, all massive neutrinos are Majorana 
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particles and what is conventionally called a Dirac neutrino is really a very 
specific choice of mass parameters for the Majorana neutrino. Let us give 
some examples: If there is only one two component neutrino (we will drop 


the prefix two component henceforth), it can have a mass mv? 


aav (to 
be called = mvv in shorthanded notation). The neutrino is now a self 
conjugate object which can be seen if we write an equivalent 4-component 


spinor w: 


p= EN (7.11) 


Note that this 4-component spinor satisfies the condition 
p = pê = CyT (7.12) 


This condition implies that the neutrino is its own anti-particle, a fact more 
transparent in the four- rather than the two-component notation. The 
above exercise illustrates an important point i.e. given any two component 
spinor, one can always write a self conjugate (or Majorana) 4-component 
spinor. Whether a particle is really its own antiparticle or not is therefore 
determined by its interactions. To see this for the electrons, one may solve 
the following exercise i.e. if we wrote two Majorana spinors using the two 
two-component spinors that describe the electron, then until we turn on the 
electromagnetic interactions and the mass term, we will not know whether 
the electron is its own antiparticle or not. Once we turn on the electro- 
magnetism, this ambiguity is resolved since electric charge conservation will 
allow a mass term that connects the two 2-component spinors and no mass 
term connecting either of the two component spinors with themselves. 

Let us now go one step further and consider two 2-component neutrinos 
(vı, v2). The general mass matrix for this case is given by: 


Məx2 = & > (7.13) 


m3 ms, 


Note first that this is a symmetric matrix and can be diagonalised by orthog- 
onal transformations. The eigen-states which will be certain admixtures of 
the original neutrinos now describe self conjugate particles. One can look 
at some special cases: 
Case 1: 

If we have m1,2 = 0 and m3 Æ 0, then one can assign a charge +1 to 
vı and —1 to vz under some U(1) symmetry other than electromagnetism 
and the theory is invariant under this extra U(1) symmetry which can 
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be identified as the lepton number and the particle is then called a Dirac 
neutrino. The point to be noted is that the Dirac neutrino is a special 
case of for two Majorana neutrinos. In fact instead of calling this a Dirac 
neutrino, we could call this a case with two Majorana neutrinos with equal 
and opposite (in sign) mass. Since a complex mass term in general refers to 
its C transformation property (i.e. Y° = ey, (where ôm is the phase of 
the complex mass term), the two two-component fields of a Dirac neutrino 
having opposite sign mass would be equivalent to having opposite charge 
conjugation properties. 
Case it: 

If we have m1,2 < ms, this case is called pseudo-Dirac neutrino since 
this is a slight departure from case (i). In reality, in this case also the 
neutrinos are Majorana neutrinos with their masses +mg + ô with ô < mo. 


The two component neutrinos will be maximally mixed. Thus this case is 
of great current physical interest in view of the atmospheric (and perhaps 
solar) neutrino data. 

Case tit: 

There is third case where one may have mı = 0 and mg < mə. In 
this case the eigenvalues of the neutrino mass matrix are given respectively 
by: my = ai and M ~ mə. One may wonder under what conditions 
such a situation may arise in a realistic gauge model. It turns out that 
if vı transforms as an SU (2)z doublet and 12 is an SU (2)z singlet, then 
the value of mg is limited by the weak scale whereas mə has no such limit 
and mı = 0 if the theory has no SU (2)z triplet field (as for instance is the 
case in the standard model). Choosing m2 >> m3 then provides a natural 
way to understand the smallness of the neutrino masses. This is known 
as the seesaw mechanism.* Since this case is very different from the case 
(i) and (ii), it is generally said that in grand unified theories, one expects 
the neutrinos to be Majorana particles. The reason is that in most grand 
unified theories there is a higher scale which under appropriate situations 
provides a natural home for the large mass mz. 

While we have so far used only two neutrinos to exemplify the various 
cases including the seesaw mechanism, these discussions generalize when 
mı,2,3 are each N x N matrices (which we denote by m3 = mp and mz = 
Mr). For example, the seesaw formula for this general situation can be 
written as 


M, ~ mi —mpMR mp (7.14) 


where the subscripts D and R are used in anticipation of their origin in 
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gauge theories where Mp turns out to be the Dirac matrix and Mp is the 
mass matrix of the right handed neutrinos and all eigenvalues of Mp are 
much larger than the elements of Mp. It is also worth pointing out that 
Eq. 7.14 can be written in a more general form where the Dirac matrices 
are not necessarily square matrices but N x M matrices with N 4 M. We 
give such examples below. 

Although there is no experimental proof that the neutrino is a Majorana 
particle, the general belief is that since the seesaw mechanism provides such 
a simple way to understand the glaring differences between the masses of 
the neutrinos and the charged fermions, neutrino is indeed most likely to 
be a Majorana particles as implied by it. 

Even though in many situations, the difference between the Dirac and 
Majorana neutrinos is not manifest, there are some physical processes where 
differences becomes explicit: one such process is when the two neutrinos 
annihilate. For Dirac neutrinos, the particle and the antiparticle are dis- 
tinct and therefore their annihilation is not restricted by Pauli principle in 
any manner. However, for the case of Majorana neutrinos, the identity of 
neutrinos and antineutrinos plays an important role and one finds that the 
annihilation to the Z-bosons occurs only via the P-waves. Similarly in the 
decay of the neutrino to any final state, the decay rate for the Majorana 
neutrino is a factor of two higher than for the Dirac neutrino. 


7.1.4. Experimental indications for neutrino masses 


There have been other lectures at this school on the experimental evidences 
for neutrino masses and their analyses to determine the current favorite 
values for the various mass differences as well as mixing angles. I will 
therefore only summarize the main results: (For detailed discussion and 
references, see and lectures by J. Conrad®). 

The evidence for neutrino masses and mixings have come from neutrino 
oscillation experiments involving neutrinos from the Sun, the cosmic rays 
as well as from accelerators. Neutrino oscillation is a phenomenon where 
neutrinos of one flavor transmute to neutrinos of another flavor. Since such 
transmutation can occur only if the neutrinos have masses and mixings, 
these experiments provide evidence for neutrino mass. To see this, note 
the expression for vacuum oscillation probability for neutrinos of a given 
energy E that have travelled a distance L is given by: 


ees A,;L 
Pog = Y leaitits;Uasleos ( T - 0935) (7.15) 
ij 
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This can be derived on the basis of simple quantum mechanical superpo- 
sition principle and the equations of time evolution of free particles. The 
observed neutrino oscillation probabilities therefore yield information about 
the mass difference squares of the neutrinos (Aj; = m? — m?) and mixing 
angles Uai. If the neutrino propagates in dense matter, the oscillation 
probability is changed by the so-called Mikheyev-Smirnov-Wolfenstein ef- 
fect however the new probability depends on the same parameters i.e. mass 
difference square and the Uai in addition to depending on the density of 
matter through which neutrinos travel. The analysis of the data for the 
atmospheric neutrinos where the neutrino propagates in vacuum and that 
for solar neutrinos where the effect of dense matter in the Sun is included 


lead to the following picture values for mass differences and mixings: 


7.1.4.1. Atmospheric neutrinos: 


The Super-Kamiokande experiment observed the oscillations of the atmo- 
spheric muon neutrinos to tau neutrinos (although the tau neutrinos from 
the oscillations have not been confirmed yet). This oscillation of v, to vr 
has now been confirmed by the accelerator observations in the K2K and the 
MINOS experiments. From the existing data several important conclusions 
can be drawn: (i) the data cannot be fit assuming oscillation between v, 
and ve nor Vu — Vs, where vs is a sterile neutrino which does not any direct 
weak interaction; (ii) the oscillation scenario that fit the data best is v,,—v, 
for the mass and mixing parameters 

Am? ~ (2 — 8) x 107? ev’; (7.16) 


ee Vr 


in?20,,_, > 0.92 


7.1.4.2. Solar neutrinos 


The second evidence for neutrino oscillation comes from the several exper- 
iments that have observed a deficit in the flux of neutrinos from the Sun 
as compared to the predictions of the standard solar model championed by 
Bahcall and his collaborators” and more recently studied by many groups. 
The experiments responsible for this discovery are the Chlorine experiment 
of Ray Davis, Kamiokande, Gallex, SAGE, Super-Kamiokande, SNO, GNO 
experiments conducted at the Homestake mine, Kamioka in Japan, Gran 
Sasso in Italy and Baksan in Russia and Sudbery in Canada respectively. 
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The different experiments see different parts of the solar neutrino spectrum. 
The details of these considerations are discussed in other lectures. 

As far as the final state goes, it can either be one of the two remaining 
active neutrinos , vy, and v, or it can be the sterile neutrino vs. SNO 
neutral current data announced recently has very strongly constrained the 
second possibility (i.e. the sterile neutrino in the final state). The global 
analyses of all solar neutrino data seem to favor the so called large mixing 
angle MSW solution with parameters: Am? ~ 1.2 x 1075 — 3.1 x 10~4eV?; 
sin? 20 ~ 0.58 — 0.95. 

This result has been confirmed by the terrestrial KamLand experiment 
which looked for oscillation for reactor neutrinos from several reactors with 
a detector at an average distance of about 100 Km from the various sources. 
It eliminated solutions to the solar neutrino puzzle based e.g. on spin flavor 
precession as well as the so called low solution and confirmed the large angle 
MSW resolution as the most plausible one. 


7.1.4.3. Search for the mixing angle 013 


The remaining mixing angle 013 has been probed by the reactor experiments 
that used the French reactor CHOOZ and the US reactor in Palo-Verde by 
looking for the oscillation of reactor electron anti-neutrinos with a detector 
at a distance of about a kilo-meter. In the simple three neutrino picture, 
the dominant oscillation in this case would be to the tau neutrinos since 
Am3, > Am3,. The absence of a signal put an upper limit on the mixing 
angle of about 013 < 0.17. There are several reactor as well as long base- 
line experiments now being prepared which will conduct a higher precision 
search for 0:3 during the next decade.” 


7.1.4.4. LSND and MiniBooNe 


Finally, we come to the last indication of neutrino oscillation from the Los 
Alamos Liquid Scintillation Detector (LSND)® experiment , where neutrino 
oscillations both from a stopped muon (DAR) as well as the one accompa- 
nying the muon in pion decay (known as the DIF) have been observed. The 
evidence from the DAR is statistically more significant and is an oscillation 
from P, to De. The mass and mixing parameter range that fits data is®: 


LSND: Am? ~ 0.2 — 2eV”; sin? 20 ~ 0.003 — 0.03 (7.17) 


390 R. N. Mohapatra 


There are also points at higher masses specifically at 6 eV? which are also 
allowed by the present LSND data for small mixings. KARMEN experiment 
at the Rutherford laboratory has very strongly constrained the allowed 
parameter range of the LSND data. Recently the Miniboone experiment 
at Fermilab has announced the results of its search for v, — ve oscillation. 
They have not found any evidence for oscillation with characteristic mass 
diffrerence square in the eV range.*? 


7.1.4.5. Neutrino-less double beta decay and Tritium decay 
experiment 


Oscillation experiments only depend on the difference of mass squares of 
the different neutrinos and the mixing angles. Therefore, in order to have 
a complete picture of neutrino masses, we need other experiments. Two 
such experiments are the neutrino-less double beta decay searches and the 
search for neutrino mass from the analysis of the end point of the electron 
energy spectrum in tritium beta decay. They have been discussed at this 
institute by P. Vogel,!! which is referred to for more details and references. 

Neutrino-less double beta decay measures the following combination of 
masses and mixing angles: 


<m >B = X UAm: (7.18) 


Therefore naively speaking it is sensitive to the overall neutrino mass scale. 
But in practice, as we will see below, for the case of both normal and in- 
verted hierarchies, it is unlikely to settle the question of the overall mass 
scale at the presently contemplated level of sensitivity in double beta decay 
searches. Only if the neutrino mass patterns are inverse hierarchical does 
one expect a visible signal in BBoy decay. We do not get into great details 
into this issue except to mention that in drawing any conclusions about 
neutrino mass from this process, one has to first have a good calculation of 
nuclear matrix elements of the various nuclei involves such as “Ge, 18°Xe, 
10°Mo etc.; secondly, another confusing issue has to do with alternative 
physics contributions to G9, which are unrelated to neutrino mass. Nev- 
ertheless, neutrino-less double beta decay is a fundamental experiment and 
a nonzero signal will establish the important result that neutrino is a Ma- 
jorana particle and that lepton number symmetry is violated. Regardless 
of whether it tells us anything about the neutrino masses, it would provide 
an indication in favor of the seesaw mechanism. Presently two experiments 
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Heidelberg-Moscow and IGEX that use enriched 7Ge have published lim- 
its of < 0.3 eV. More recently, evidence for a double beta signal in the 
Heidelberg-Moscow data has been claimed.!* Several experiments are now 
under planning e.g. EXO, Majorana and Cuore etc. which are expected to 
improve the sensitivity to the Majorana mass of the neutrino to the level 
of 20 milli-eV. This can for example test the hypothesis that neutrino mass 
ordering may be inverted if they are Majorana fermions. 

Another important result in further understanding of neutrino mass 
physics could come from the tritium end point searches for neutrino 
masses. This experiment will measure the parameter my = y}; [Ucil m?. 
This involves a different combination of masses and mixing angles than 
< m >gg. Presently, the KATRIN proposal for a high sensitive search for 
for m, has been made and it is expected that it can reach a sensitivity of 
0.2 eV. 

A third source of information on neutrino mass will come from cosmol- 
ogy, where more detailed study of structure in the universe is expected to 
provide an upper limit on 57, m; of less than an eV. Present WMAP data 
appears to provide a limit of >, mp, < 0.6 — 1 eV.!% 

Our goal now is to study the theoretical implications of these discov- 
eries. We will proceed towards this goal in the following manner: we will 
isolate the mass patterns that fit the above data and search for patterns 
and symmetries that lead to observed mixing angles. We then look for 
plausible models that can first lead to the general feature that neutrinos 
have tiny masses; then we would try to understand in simple manner some 
of the features indicated by data in the hope that these general ideas will 
be part of our final understanding of the neutrino masses. As mentioned 
earlier on, to understand the neutrino masses one has to go beyond the 
standard model. First we will sharpen what we mean by this statement. 
Then we will present some ideas which may form the basic framework for 
constructing the detailed models. 


7.2. Physics of neutrino mass 


There are two distinct aspects to neutrino mass physics: first is the absolute 
overall magnitude and second is the flavor structure. Understanding the 
first will reveal the gross features of the new physics such as the presence of 
anew symmetry and its scale responsible for the smallness of neutrino mass 
compared to masses of other fundamental fermions, whereas understanding 
the flavor pattern is likely to throw light on possible new family symmetries 
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of matter which may in turn be relevant to unravelling the mystery of quark 
flavor. It could be (probably likely) that both are related to each other. 

To begin this discussion, we first list the puzzles of neutrino mass 
physics; we then discuss the neutrino mass matrix which is the starting 
point of many attempts to understand the neutrino mixings and then dis- 
cuss ideas that have been proposed to understand these patterns before 
going to a discussion of the overall scale. 


7.2.1. Puzzles of neutrino mass physics 


The present neutrino discoveries have posed a list of puzzles for physics 
beyond the standard model, whose resolution will provide an unmistakable 
path beyond it. Below we give a list of these puzzles. 


e Ultra-light-ness of neutrinos: Why are the neutrino masses so much 
lighter than the quark and charged lepton masses? 

e Bi-large mixing: How to understand simultaneously two large mix- 
ing angles one for the u — T and another for e — u? 

e Smallness of Am?,/Am%: Experimentally, Am?, ~ 1077 Am}. 
How does one understand this in a natural manner? 

e Smallness of Ue3: The reactor results also seem to indicate that the 
angle 613 = U-3 is a very small number. One must also understand 
this in a framework that simultaneously explains all other puzzles. 


Possible other puzzles include a proper understanding of neutrino mass 
degeneracy if there is a large positive signal for the neutrinoless double beta 
decay and of course, when we have evidence for CP violating phases in the 
mass matrix, we must understand their magnitude. 


In order to take the first step towards understanding these puzzles, we 
discuss the flavor pattern of leptons that may be at the root of large mixings 
and defer the discussion of the origin of mass scale to the next section and 
finally focus on specific unification models which address both the issues 
as examples how one may proceed to unravel the grand picture of physics 
beyond standard model inspired by neutrino physics. It could of course be 
that large neutrino mixings result from a joint effect of both the charged 
lepton and the neutrino matrix since Upyng = Uj U, and we do not 
know apriori whether the large neutrino mixings come from the charged 
lepton sector or the neutrino sector or both. However, we first follow the 
line of thinking that in the fundamental theory, charged lepton mass matrix 
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is diagonal or near diagonal and all mixings result from the neutrino mass 
matrix. This point of view is not so unreasonable since charged lepton mass 
matrix is likely to be similar to the quark sector and the small observed 
CKM mixings pretty much guarantees that quark mass matrices are near 
diagonal. We will also present grand unified models where this conjecture 
is borne out. 


7.2.2. Notation 


We will assume two component neutrinos and therefore their masses will in 
general be Majorana type. Let us also give our notation to facilitate further 
discussion: the neutrinos emitted in weak processes such as the beta decay 
or muon decay are weak eigenstates and are not mass eigenstates. The mass 
eigenstates determine how a neutrino state evolves in time. Similarly, in 
the detection process, it is the weak eigenstate that is picked out. This is of 
course the key idea behind neutrino oscillation and the formula presented 
in the last section. To set the notation, let us express the weak eigenstates 
in terms of the mass eigenstates. We will denote the weak eigenstate by 
the symbol a, or simply e, 1,7 etc whereas the mass eigenstate will be 
denoted by the symbols i,j,k etc. To relate the weak eigenstates to the 
mass eigenstates, let us start with the mass terms in the Lagrangian for the 
neutrino and the charged leptons: 


Lm = vE Mur + Ep MER + h.c. (7.19) 


Here the v and E which denote the column vectors for neutrinos and 
charged leptons are in the weak basis. To go to the mass basis, we di- 
agonalize these matrices as follows: 


UTM „Ur = d, (7.20) 
VLM: V} = de 


The physical neutrino mixing matrix is then given by: 
U = VU, (7.21) 


Ua; and relate the two sets of eigenstates (weak and mass) as follows: 


VY, |= UL 2 (7.22) 


394 R. N. Mohapatra 


Using this equation, one can derive the well known oscillation formulae for 
the survival probability of a particular weak eigenstate œ discussed in the 
previous section. 

To see the general structure of the mixing matrix U, let us recall that 
the matrix M, is complex and symmetric and therefore has six complex 
parameters describing it for the case of three generations. But since the 
neutrino is described by a complex field, we can redefine the phases of three 
fields to remove three parameters. That leaves nine parameters. In terms 
of observables, there are three mass eigenvalues (m1, Mm2,m3) and three 
mixing angles and phases in the mixing matrix U. The three phases can 
be split into one Dirac phase, which is analogous to the phase in the quark 
mixing matrix and two Majorana phases. We can then write the matrix U 
as 


U = U| ef (7.23) 


The matrix U) has three real angles 612, 023,013 and a phase. The goal 
of experiments is to determine all nine of these parameters. The knowledge 
of the nine observables allows one to construct the mass matrix for the 
neutrinos and from there one can go in search of the new physics beyond 
the standard model that leads to such a mass matrix. 

The neutrino mass observables given above can be separated into two 
classes: (i) oscillation observables and (ii) non-oscillation observables. The 
first class of observables are those accessible to neutrino oscillation experi- 
ments and are the two mass differences Am?, and Am3}; three mixing angles 
O12 (or 0o); 023 (or 64) and 613 (the reactor angle, also called U-3) and the 
CP phase ô in U). The remaining three observables i.e. the lightest mass 
of the three neutrinos and the two Majorana phases ¢1,2 can only be probed 
by nonoscillation experiments such as Boy decay, beta decay spectrum at 
the endpoint and cosmological observations etc. 


7.3. Neutrino mixing matrix and mass patterns 


A good starting point for the exploration of new physics such as new symme- 
tries, new scaleshidden in neutrino observations is to construct the neutrino 
mass matrix in the basis where the charged leptons are mass eigenstates 
(or near mass eigen states.) One can then look for symmetries which could 
be responsible for this form of the neutrino mass matrix in extensions of 
the standard model. 
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In order to construct the neutrino mass matrix, we will use the following 
experimental numbers: Am? ~ 0.0021 eV?; for solar neutrinos, it gives 
Am?, ~ (7.21—8.63) x 10~° eV?. It also provides information on the angles 
in U which can be summarized by the following mixing matrix (neglecting 
all CP phases): 


G S € 
s+ce C— SE 1 
U= EA 2 Z (7.24) 
S—CE —c—seE 1 
v2 v2 V2 


where from the discussion above e < 0.17; s = sinĝı2 is in the range 
0.267 < sin? 6,2 < 0.371 with the central value being near 0.314. We have 
chosen the atmospheric mixing angle 623 to be maximal. 

A particularly interesting form of the mixing matrix which seems to be 
in accord with data is the so-called tri-bi-maximal form!° where: 


2 Ł 0 
3 8 
Upmns = -5 Fw a (7.25) 
1 1 1 
V6 V3 V2 


As far as the mass pattern goes however, there are three possibilities: 


e (i) normal hierarchy: mı < M2 < m3; 
e (ii) inverted hierarchy: mı ~ —m2 >> mz and 
e (iii) approximately degenerate pattern! mı ~ m2 ~ ms, 


where m; are the eigenvalues of the neutrino mass matrix. In the first 
case, the atmospheric and the solar neutrino data give direct information 
on m3 and mz respectively. On the other hand, in the last case, the mass 
differences between the first and the second eigenvalues will be chosen to fit 
the solar neutrino data and the second and the third to fit the atmospheric 
neutrino data. 


7.3.1. Neutrino mass textures 


From the mixing matrix in Eq. 7.24, we can write down the allowed neutrino 
mass matrix for any arbitrary mass pattern assuming the neutrino is a 
Majorana fermion. Denoting the matrix elements of M, as tag for a, 8 = 
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1, 2,3, we have (Recall that pag = Hga): 
M1 = [m + s*m2+ ema] 


u12 = [—c(s + ce)m, + 8(c — se)m2 + ema] 


H13 = [—c(s — ce)m, — s(c + se)m2 + ems] 
(7.26) 


U22 = G + ce)?m, + (c — se)?m2 + ms] 
u23 = =|- (8? — c2e?2)m, — (c? — 8?e2)m2 + m3] 
1133 = alls — ce)?m, + (c+ se)?m2 + ma]. 


In certain limits for the mixing angles in the above mass matrix, symmetries 
leptons can manifest. Below I give some examples: 


7.3.2. p— T exchange symmetry 


The current observations that 023 is very close to maximal (with 023 = 4 
being the best fit solution in many analyses) and the fact that 613 could 
be vanishing can be understood if the neutrino Majorana mass matix has 
a Zə symmetry that interchanges u — 7.16 The corresponding mass matrix 


is the special case of the matrix below with c= 1, and a= b. 


de” be ac 
M, =\/Amå | be 1e 1 (7.27) 
ae 1 1+ ce 
where a,b,c,d are parameters of order one and e€ ~ = ~ 0.2. Since 
at the moment it is not certain whether u — 7 symmetry is exact or ap- 
proximate, the mass matrix in Eq. 7.27 includes small breaking terms char- 
acterized by a  b and c 4 1. These small departures lead to non-zero 
values for 6;3 and 693 — a which are correlated with each other.!” We have 
ignored leptonic CP violation in this discussion. One can have other ways 
of introducing u — T symmetry breaking using CP phases.'® 
Overall, this symmetry has a good chance be part of the final theory of 
neutrino mixing since there seems to be good experimental support for it. 
This has therefore led to a great deal of model building activity’? most of 
whom predict departures from the exact symmetry limit and can provide 
insight into which way to proceed in assimilating this symmetry as part of 
the quark-lepton world. 
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7.3.3. Mass matriz for tri-bi-maximal mixing and 
associated approximate flavor symmetries 


In the u — 7 symmetric models, the value of the solar mixing angle 012 
remains large but arbitrary. The indication that the value of sin #12 ~ 5 
may be an indication of higher symmetries of the lepton world. This is 
15 Clearly these must have u — T 
symmetry as a subgroup. A typical neutrino mass matrix that leads to the 


called tri-bi-maximal mixing pattern. 


tri-bi-maximal mixing pattern is: 


a b b 
M, = b a+c b-c (7.28) 
b b-c a+c 


where a,b,c are arbitrary parameters. The charged lepton mass matrix is 
chosen to be diagonal. Diagonalizing this matrix leads to the Upmyns in 
the tri-bi-maximal form and the neutrino masses: mı = a — b; mz = a+ 2b 
and mg = a — b + 2c. Clearly if a ~ b < c, we get a normal hierarchy for 
masses. A symmetry that has been found to lead to this mass matrix is 
83.20 

Another way to get the tri-bi-maximal form is to use specific forms 
for the charged lepton and neutrino mass matrices in such a way that the 
combined diagonalization leads to the desired lepton mixing matrix. An 
example is to have a charged lepton mass matrix of the form: 


a 0 0 
M, = 0 a b 
0 b a 


(7.29) 


2i 


where w =e3. 
some symmetries, but remarkably enough it has been shown that this can 


This for may appear much too contrived to arise from 


emerge under certain assumptions if the assumed symmetry is A4,?! which 
is group of even permutations of four elements. 

Detailed theory for all these cases involves many Higgs multiplets and 
probably should not be taken literally but the important message is the 
presence of the hidden symmetry and its implications. Generally such sym- 
metries are hard to grand unify and further research along this direction is 
going to be important. 
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7.3.4. Inverted hierarchy and Le — L, — L, symmetry 


In this sub-section, we consider another interesting clue to model building 
present in neutrino data if the mass arrangement is inverted. It starts with 
the observation that if the neutrino mass matrix has the form 


0 AA 
M, = |A 0 0 (7.30) 
A 0b 0 


this leads to two degenerate neutrinos with mass +/2A and one massless 
neutrino. The atmospheric mass difference is given by Am?, = 2A? and 
mixing angle 64 = 7/4. As far as the solar ve oscillation is concerned, the 
sin? 205 = 1 but Am?, = 0. While this is unphysical, this raises the hope 
that as corrections to this mass matrix are taken into account, it may be 
possible understand the smallness of Am /Amĝ naturally. 

In fact this hope is fortified by the observation that this mass matrix 
has the leptonic symmetry Le — L, — L7; therefore one might hope that as 
this symmetry is broken by small terms, one will end up with a situation 
that fits data well. 

This question was studied in two papers. To proceed with the 
discussion, let us consider the following mass matrix for neutrinos where 
small Le — L, — L- violating terms have been added. 


25,26 


ZG +8 
Mmi e y dl. (7.31) 
s d x 


The charged lepton mass matrix is chosen to have a diagonal form in this 
basis and Le — L, — L, symmetric. In the perturbative approximation, 
there are sum rules involving the neutrino observables and the elements of 
the neutrino mass matrix,?° which first of all imply (i) a close connection 
between the measured value of the solar mixing angle and the neutrino mass 
measured in neutrino-less double beta decay; (ii) the present values for the 
solar mixing angle can be used to predict the mgg for a value of the Am?.. 
For instance, for sin? 204 = 0.9, we would predict (Zee — z) = 0.3. For 
small Am, this implies mgg œ 0.01 eV. This is expected to be within the 
reach of new double beta decay experiments contemplated.?’ In fact now 
there exist more thorough numerical analyses?’ of general Le — L, — L- 
broken models which imply that the inverted hierarchy for neutrinos can 
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be tested in neutrinoless double beta experiments in the next decade or 
so. This way of breaking Le — L, — Lr symmetry also implies a value 
for sin? 2012 > 0.9, which is now almost ruled out. Of course there could 
be other ways of breaking this symmetry using charged lepton sector etc. 
which can still lead to lower solar angle. 

If the value of sin? 265 is ultimately determined to be less than 0.9, the 
question one may ask is whether the idea of L.— L, — L- symmetry is dead. 
The answer is in the negative since so far we have explored the breaking of 
Le — L, — L- symmetry only in the neutrino mass matrix. It was shown 
in? that if the symmetry is broken in the charged lepton mass, one can 
lower the sin? 206 as long as the value of Ues is sizable. However given 
the present upper limit on Ue3, the smallest value is somewhere around 
sin? 205 ~ 0.8. 


7.3.5. Scale invariant mass matrix 


Most of the above forms for the mass matrices are scale dependent in the 
sense that once radiative corrections are taken into account, their forms can 
change. This is specially relevant because in many theories neutrino mass 
matrix is predicted at a high scale due to physics at this scale. They have 
to be extrapolated to the weak scale to compare with experiments. This 
process relies on the nature of physics between the neutrino mass generation 
scale and the weak scale. Thus connection between fundamental physics 
responsible for neutrino masses and observations gets interrupted. Luckily 
the radiative correction effects are not significant if neutrino mass hierarchy 
is normal (see discussion later). However both for inverted and degenerate 
spectra, they are. It is therefore interesting to search for neutrino flavor 
structure that is not affected by such effects. One such form arises when 
the neutrino mass matrices satisfy certain scaling properties.2°> An example 
of such a mass matrix is: 


M, = mo B D D/c |. (7.32) 
B/e D/e DJe 
It is called u — 7 scaling whose most important phenomenological is that 


it leads to an inverted hierarchy with m3 = 0 and Ue3 = 0. Atmospheric 
neutrino mixing is governed by the “scaling factor” c via tan? 023 = 1/c?, 
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i.e., is in general non-maximal because c is naturally of order, but not equal 
to, one. These results are scale independent predictions and do not depend 
on extraneous physics between the neutrino mass generation scale and the 
weak scale. It is interesting to note that current data analyzes (though 
at the present stage statistically not very significant) yield non-maximal 
tan? 023 = 0.89 as the best-fit point.° 


7.3.6. CP violation 


A not very well explored aspect of neutrino physics at the moment is CP 
violation in lepton physics. Unlike the quark sector, CP violation for Ma- 
jorana neutrinos allows for more phases than in the quark sector. Since the 
Majorana neutrino mass matrix is symmetric, for N generations of neutri- 
nos, there are in general NW) phases in it. When the mass matrix is 
diagonalized, these phases will appear in the unitary matrix Uz that does 
the diagonalization (i.e. U'M,Uz, = dp). If we are working in a basis 
where the charged lepton mass matrix is diagonal, then Uz is the leptonic 
weak mixing matrix. As we saw this has N(N + 1)/2 phases. Out of them, 
redefinition of the charged lepton fields in the weak current allows the re- 
moval of N phases; so there are N(N — 1)/2 phases in the neutrino masses. 
In the quark sector, both up and down fields could be redefined allowing 
for the number of physical phases that appear in the end to be smaller. 
However for Majorana neutrinos, redefinition of the fields does not remove 
the phases entirely from the theory but rather shifts them to other places 
where they can manifest themselves physically.2? The detailed discussion 
of CP violation is in other lectures at this school. 


7.4. Neutrino mass scale and physics beyond the standard 
model 


In the standard model (SM), the neutrino mass vanishes to all orders in 
perturbation theory as well as nonperturbatively implying that observa- 
tion of neutrino masses is the first laboratory evidence for physics beyond 
the standard model. To clarify this point, note that the SM is based 
on the gauge group SU(3). x SU(2), x U(1)y under which the quarks 
and leptons, Higgs bosons and gauge bosons transform as described in the 
Table I. 
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Table I 
Field gauge transformation 
Quarks Qz (3,2, 3) 
Righthanded up quarks uz (3, 1, $) 
Righthanded down quarks dR (3,1, -2) 
Lefthanded Leptons L (1,2—1) 
Righthanded leptons er (1,1, —2) 
Higgs Boson H (1,2, +1) 
Color Gauge Fields Ga (8,1,0) 
Weak Gauge Fields W*, Z, y (1,3 + 1,0) 


Table caption: The assignment of particles to the standard model gauge 
group SU (3)e x SU(2)z x U(1)y. 


The electro-weak symmetry SU (2)z x U(1)y is broken by the vacuum 
expectation of the Higgs doublet < H° >= a ~ 186 GeV, which gives 
mass to the gauge bosons. The fermion masses arise from the Yukawa 
couplings: 


Ly = huQLHur+ haQrHdp+heLHer + hc. (7.33) 


when H° acquires a vev. Note that since there are no right handed neutri- 
nos in the theory, there is no term in Eq. 7.33 that can give mass to the 
neutrinos. Thus they remain massless at the tree level. 

There are several questions that arise at this stage. What happens when 
one goes beyond the above simple tree level approximation? Secondly, do 
non-perturbative effects change this tree level result? Finally, how to judge 
whether this result will be modified when the quantum gravity effects are 
included? 

The first and second questions are easily answered by using the B-L 
symmetry of the standard model. The point is that since the standard 
model has no SU(2)r singlet neutrino-like field, the only possible mass 
terms that are allowed by Lorentz invariance are of the form vj,C~!v;r, 
where 7,7 stand for the generation index and C is the Lorentz charge con- 
jugation matrix. Since the viz is part of the SU(2)z doublet field and 
has lepton number +1, the above neutrino mass term transforms as an 
SU (2)z triplet and furthermore, it violates total lepton number (defined as 
L = Le + L, + L+) by two units. However, a quick look at the standard 
model Lagrangian convinces one that the model has exact lepton num- 
ber symmetry after symmetry breaking; therefore such terms can never 
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arise in perturbation theory. Thus to all orders in perturbation theory, the 
neutrinos are massless. As far as the nonperturbative effects go, the only 
known source is the weak instanton effects. Such effects could change the 
result if they broke the lepton number symmetry. One way to see if such 
breaking occurs is to look for anomalies in lepton number current conser- 
vation from triangle diagrams. Indeed it is easy to convince oneself that 
Ja = cWW +c’ BB due to the contribution of the leptons to the triangle 
involving the lepton number current and W’s or B’s. Luckily, it turns out 
that the anomaly contribution to the baryon number current nonconserva- 
tion has also an identical form, so that the B — L current j% _ z, is conserved 
to all orders in the gauge couplings. As a consequence, nonperturbative 
effects from the gauge sector cannot induce B — L violation. Since the 
neutrino mass operator described above violates also B — L, this proves 
that neutrino masses remain zero even in the presence of nonperturbative 
effects. 

Let us now turn to the effect of gravity. Clearly as long as we treat 
gravity in perturbation theory, the above symmetry arguments hold since 
all gravity coupling respect B — L symmetry. However, once nonpertur- 
bative gravitational effects e.g black holes and worm holes are included, 
there is no guarantee that global symmetries will be respected in the low 
energy theory. The intuitive way to appreciate the argument is to note 
that throwing baryons into a black hole does not lead to any detectable 
consequence except through a net change in the baryon number of the uni- 
verse. Since one can throw in an arbitrary number of baryons into the 
black hole, an arbitrary information loss about the net number of missing 
baryons would prevent us from defining a baryon number of the visible 
universe- thus baryon number in the presence of a black hole can not be 
an exact symmetry. Similar arguments can be made for any global charge 
such as lepton number in the standard model. A field theoretic parame- 
terization of this statement is that the effective low energy Lagrangian for 
the standard model in the presence of black holes and worm holes etc must 
contain baryon and lepton number violating terms. In the context of the 
standard model, the only such terms that one can construct are nonrenor- 
malizable terms of the form LHLH/Mp,. After gauge symmetry breaking, 
they lead to neutrino masses; however these masses are at most of order 
v2,/Mpc ~ 1075 eV.°° But as we discussed in the previous section, in order 
to solve the atmospheric neutrino problem, one needs masses at least three 
orders of magnitude higher. 
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Thus one must seek physics beyond the standard model to explain ob- 
served evidences for neutrino masses. While there are many possibilities 
that lead to small neutrino masses of both Majorana as well as Dirac kind, 
here we focus on the possibility that there is a heavy right handed neutrino 
(or neutrinos) that lead to a small neutrino mass. The resulting mechanism 
is known as the seesaw mechanism‘ and leads to neutrino being a Majorana 
particle. 

The nature and origin of the seesaw mechanism can also be tested in 
other experiments and we will discuss them below. This will be dependent 
on the kind of operators that play a role in generating neutrino masses. If 
the leading order operator is of dimension 5, then the scale necessarily is 
very high (of order 101? GeV or greater). On the other hand, in theories 
with extra space dimensions, this operator may be forbidden and one may 
be forced to go to higher dimensional operators, in which case the scale 
could be lower or it could be that neutrino Dirac Yukawa couplings are of 
order 107° (similar to the electron Yukawa coupling in the standard model) 
in which case even the seesaw scale could be in the TeV range. 

The seesaw mechanism raises a very important question: since we re- 
quire the mass of the right handed neutrino to be much less than the Planck 
scale, a key question is “what symmetry keeps the right handed neutrino 
mass lighter?” We will give two examples of symmetries that can do this. 


7.4.1. Seesaw and the right handed neutrino 


The simplest possibility extension of the standard model that leads to 
nonzero mass for the neutrino is one where only a right handed neu- 
trino is added to the standard model. In this case vr, and vg can form 
a mass term; but apriori, this mass term is like the mass terms for charged 
leptons or quark masses and will therefore involve the weak scale. If we 
call the corresponding Yukawa coupling to be Y,, then the neutrino mass 
is mp = Y,v/ v2. For a neutrino mass in the eV range requires that 
Y, ~ 1071! or less which is far below even the small electron Yukawa cou- 
pling of SM. Introduction of such small coupling constants into a theory 
is generally considered unnatural and a sound theory must find a symme- 
try reason for such smallness. As already already alluded to before, seesaw 
mechanism,* where we introduce a singlet Majorana mass term for the right 
handed neutrino is one way to achieve this goal. The effective mass terms 
for the (vL, vr) system is given by (suppressing the generation index): 


Lm = MpvpvR + MpvEC™vR + he. (7.34) 
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where Mp = Y„Vwk, Y, is the lepton doublet coupling to the right handed 
neutrinos (defined as PRY,L) . Suppose we write v spinor in terms of 


its two component spinors as vy = |. ‘: , then vg = ”) and 
ioa N* 0 
v = 9 This gives remembering that See and 
Re oS ion N* jJ” g g P = I 0 
} 0 
ome _ 109 
( 0 —109 
Lm = imvlagN* +iMġN'oN* + hc. (7.35) 
. We can write the neutrino mass matrix as: 
i v 
J = i (vT NT) oM a + h.c. (7.36) 


where M has the form: 


0 mp 
M = 7.37 

(n3 ta) cee 
Since MR is not constrained by the standard model symmetries, it is nat- 
ural to choose it to be at a scale much higher than the weak scale. Now 
diagonalizing this mass matrix , we get a set of heavy eigenstate Nr anda 
set of light eigenstates with mass matrix given by: 


M, ~ -m4 M}'mp (7.38) 


This provides a natural way to understand a small neutrino mass with- 
out any unnatural adjustment of parameters of a theory. In a subsequent 
section, we will discuss a theory which connects the scale Mp to a new 
symmetry of nature beyond the standard model. This formula for neutrino 
masses is called type I seesaw formula. 


7.4.1.1. Why is Mp < Mpe? 


The question “why Mr < Mpg?” is in many ways similar to the question 
in the standard model i.e. “why is MHiggs < Mpe ?” It is well known that 
searches for answer to this latter question has led to many interesting pos- 
sibilities for physics beyond the standard model including supersymmetry, 
extra dimensions and technicolor. It is hoped that answering this question 
for vp can also lead us to new insight into new symmetries beyond the 
standard model. There are two interesting answers to our question that I 
will elaborate later on. 
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If one adds three right handed neutrinos to implement the seesaw mech- 
anism, the model admits an anomaly free new symmetry i.e. B—L. One can 
therefore extend the standard model symmetry to either SU(2) 1, xU(1) tan X 
U(1)s-z or its left-right symmetric extension SU (2)z x SU(2)rxU(1)B_x. 
In either case the right handed neutrino carries the B-L quamtum number 
and its Majorana mass breaks this symmetry. Therefore, the mass of the 
vr can at most be the scale of B — L symmetry breaking, hence answering 
the question “why Mygr < Mpe?”. 


SU(2) a: 


While local B — L is perhaps the most straight forward and natural 
symmetry that keeps vp lighter than the Planck scale, another possibility 
has recently been suggested in Ref. 24. The main observation here is that 
is the standard model is extended by including a local SU(2)4 symme- 
try acting on the first two lepton generations including the right handed 
charged leptons, then global Witten anomaly freedom dictates that there 
must be at least two right handed neutrinos which transform as a doublet 
under the SU(2) 4 local symmetry. In this class of models, in the limit of 
exact SU(2) symmetry, the vp’s are massless and as soon as the SU(2) 4 
symmetry is broken, they pick up mass. Therefore “lightness” of the vp’s 
compared to the Planck scale in these models is related to an SU(2) g sym- 
metry. These comments are elaborated with explicit examples later on in 
this review. 


7.4.2. Double seesaw mechanism with vr and B-L singlet 
neutral fermions 


As we saw from the previous discussion, the conventional seesaw mechanism 
requires rather high mass for the right handed neutrino and therefore a 
correspondingly high scale for B-L symmetry breaking. The right handed 
neutrinos are B-L non-singlet fields. There is however no way at present 
to know what the scale of B-L symmetry breaking is. For lower B-L scale 
models, one must either find a mechanism to suppress the Dirac mass in 
the conventional seesaw formula or extend the theory some other way. A 
particularly simple way is to introduce, B-L singlet heavy neutrinos S and 
use a double seesaw mechanism suggested in Ref. 31 where one writes a 
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three by three neutrino mass matrix in the basis (v, N, S) of the form: 


M = |mp 0 M (7.39) 
0 M wp 


This comes from an effective mass Lagrangian of the form: 
Li, = MmbPLNR + MtNeS +puSPC7'S, + h.c. (7.40) 


It is possible to have extra symmetries that guarantees the above form for 
the Lagrangian. (It will be a good exercise to discover these symmetries.) 
For the case u « M ~ Mp-z, (where Mp-z is the B — L breaking scale) 
this matrix has one light and two heavy neutrinos per generation and the 
latter two form a pseudo-Dirac pair with mass of order Mg-z. The impor- 
tant thing for us is that the light mass eigenvalue is given by mĉ& u/M?; for 
mp % u ~ GeV, a 10 TeV B — L scale is enough to give neutrino masses 
in the eV range. For the case of three generations, the formula for the light 
neutrino mass matrix is given by: 


M, = m§M-!pM- mp. (7.41) 


7.4.3. High mass Higgs triplet induced neutrino masses 


As already noted, one way to generate nonzero neutrino masses without 
using righthanded neutrinos is to extend the standard model by the addition 
of an SU(2),, triplet Higgs field with Y = 2 so that the electric charge profile 
of the members of the multiplet is given as follows: (At*, At, A°). This 
allows an additional Yukawa coupling of the form fz LTrərL.A, where the 
A? couples to the neutrinos. Clearly A field has L = 2. When A? field 
has a nonzero vev, it breaks lepton number by two units and leads to 
Majorana mass for the neutrinos. There are two questions that arise now: 
one, how does the vev arise in a model and how does one understand the 
smallness of the neutrino masses in this scheme. There are two answers to 
the first question: One can maintain exact lepton number symmetry in the 
model and generate the vev of the triplet field via the usual “mexican hat” 
potential. There are two problems with this case. This leads to the triplet 
Majoron which has been ruled out by LEP data on Z-width. In any case, 
in this model smallness of the neutrino mass is not naturally understood. 
Another way to generate the induced vev is to keep a large but positive 
mass (Ma) for the triplet Higgs boson and allowing for a lepton number 
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violating coupling MA*HH.? In this case, minimization of the potential 


induces a vev for the A? field when the doublet field acquires a vev"®: 
Mov? 
= 0 wk 
vr =< A >= A (7.42) 


Since the mass of the A field is invariant under SU(2)z x U(1)y, it can be 
very large connected perhaps with some new scale of physics. If we assume 
that Ma ~ M ~ 10! GeV or so, we get vr ~ eV. Now in the Yukawa 
coupling f,L7727L.A, since the A? couples to the neutrinos, its vev leads 
to a neutrino mass We will see later when we discuss the seesaw models 
that unlike those models, the neutrino mass in this case is not hierarchically 
dependent on the charged fermion masses. Note further the high mass 
suppression in Eq. 7.42 leading to a new kind of seesaw suppression. This 
is called type II seesaw. 


7.5. Left right symmetric unification: a natural realization 
of the seesaw 


Let us now explore the implications of including the righthanded neutrinos 
into the extensions of the standard model to understand the small neu- 
trino mass by the seesaw mechanism. As already emphasized, if we assume 
that there are no new symmetries beyond the standard model, the right 
handed neutrino will have a natural mass of order of the Planck scale mak- 
ing the light neutrino masses too small to be of interest in understanding 
the observed oscillations. We must therefore search for new symmetries 
that can keep the RH neutrinos at a lower scale than the Planck scale. A 
new symmetry always helps in making this natural. 

To study this question, let us note that the inclusion of the right handed 
neutrinos transforms the dynamics of the gauge models in a profound way. 
To clarify what we mean, note that in the standard model (that does not 
contain a vp) the B-— L symmetry is only linearly anomaly free i.e. Tr[(B— 
L)Q?] = 0 where Qa are the gauge generators of the standard model but 
Tr(B — L)? # 0. This means that B — L is only a global symmetry and 
cannot be gauged. However as soon as the vp is added to the standard 
model, one gets Tr{(B — L)?] = 0 implying that the B-L symmetry is now 
gaugeable and one could choose the gauge group of nature to be either 
SU(2)r x U(1)ise x U(1)s-L or SU(2)r x SU(2)r x U(1)B-L, the latter 
being the gauge group of the left-right symmetric models.?” Furthermore 
the presence of the vr makes the model quark lepton symmetric and leads 
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to a Gell-Mann-Nishijima like formula for the electric charges’? i.e. 


B-L 
Q = Isr +Isr+ =e (7.43) 


The advantage of this formula over the charge formula in the standard 
model charge formula is that in this case all entries have a physical meaning. 
Furthermore, it leads naturally to Majorana nature of neutrinos as can be 
seen by looking at the distance scale where the SU(2), x U(1)y symmetry 
is valid but the left-right gauge group is broken. In that case, one gets 


AQ =0=Alzz: (7.44) 


We see that if the Higgs fields that break the left-right gauge group carry 
righthanded isospin of one, one must have |AL| = 2 which means that the 
neutrino mass must be Majorana type and the theory will break lepton 
number by two units. 

Let us now proceed to give a few details of the left-right symmetric 
model and demonstrate how the seesaw mechanism emerges in this model. 

The gauge group of the theory is SU(2)z x SU(2)r x U(1)gs-z with 
quarks and leptons transforming as doublets under SU(2)z, r. In Table I, 
we denote the quark, lepton and Higgs fields in the theory along with their 
transformation properties under the gauge group. 


Table II 
Fields SU(2)r x SU(2)R x U(1)s-L 
representation 

QL (2,1,+3) 

QR (1,2,3) 

Lt (2,1,—1) 

Lr (1,2,—1) 

o) (2,2,0) 

Ar (3,1,+ 2) 

Ar (1,3,4 2) 
Table caption Assignment of the fermion and Higgs fields to the repre- 
sentation of the left-right symmetry group. 


The first task is to specify how the left-right symmetry group breaks to 
the standard model i.e. how one breaks the SU(2)r x U(1)B_1 symmetry 
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so that the successes of the standard model including the observed pre- 
dominant V-A structure of weak interactions at low energies is reproduced. 
Another question of naturalness that also arises simultaneously is that since 
the charged fermions and the neutrinos are treated completely symmetri- 
cally (quark-lepton symmetry) in this model, how does one understand the 
smallness of the neutrino masses compared to the other fermion masses. 

It turns out that both the above problems of the LR model have a 
common solution. The process of spontaneous breaking of the SU(2)r 
symmetry that suppresses the V+A currents at low energies also solves the 
problem of ultralight neutrino masses. To see this let us write the Higgs 
fields explicitly: 


ao (YF Bae (29) os 


All these Higgs fields have Yukawa couplings to the fermions given symbol- 
ically as below. 
Ly =hLpoLe + həLrýLr 
+ hi QroQr + hoQroQr 
+ f(LrLr Ar + LRLRAR) + hc. (7.46) 


The SU(2)r x U(1)B-z is broken down to the standard model hyper- 
charge U(1)y by choosing < A% >= vr # 0 since this carries both 
SU(2)r and U(1)p_r quantum numbers. It gives mass to the charged 
and neutral righthanded gauge bosons i.e. My, = gvr and Mz = 
V2guR cos Oy /\/cos20w. Thus by adjusting the value of vr one can sup- 
press the right handed current effects in both neutral and charged current 
interactions arbitrarily leading to an effective near maximal left-handed 
form for the charged current weak interactions. 

The fact that at the same time the neutrino masses also become small 
can be seen by looking at the form of the Yukawa couplings. Note that the 
f-term leads to a mass for the right handed neutrinos only at the scale vp. 
Next as we break the standard model symmetry by turning on the vev’s 
for the ¢ fields as Diag < ¢ >= (k, K’), we not only give masses to the Wz 
and the Z bosons but also to the quarks and the leptons. In the neutrino 
sector the above Yukawa couplings after SU(2), breaking by < ¢ >4 0 
lead to the so called Dirac masses for the neutrino connecting the left and 
right handed neutrinos. In the two component neutrino language, this leads 
to the following mass matrix for the v, N using the notation earlier with 
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the four component v = 0 a ) 
togN 


0 hk 
we (9 t) fa 


Note that mp in previous discussions of the seesaw formula (see Eq. ()) is 
given by mp = hk, which links it to the weak scale and the mass of the RH 
neutrinos is given by Mr = fur, which is linked to the local B-L symmetry. 
This justifies keeping RH neutrino mass at a scale lower than the Planck 
mass. It is therefore fair to assume that seesaw mechanism coupled with 
observations of neutrino oscillations are a strong indication of the existence 
of a local B-L symmetry far below the Planck scale. 

By diagonalizing this 2 x 2 matrix, we get the light neutrino eigenvalue 
to be m, ~ hey and the heavy one to be fur. Note that typical charged 
fermion masses are given by h’k etc. So since vg > k,k’, the light neutrino 
mass is automatically suppressed. This way of suppressing the neutrino 
masses is called the seesaw mechanism.* Thus in one stroke, one explains 
the smallness of the neutrino mass as well as the suppression of the V+A 
currents. 

In deriving the above seesaw formula for neutrino masses, it has been 
assumed that the vev of the lefthanded triplet is zero so that the vzvz, entry 
of the neutrino mass matrix is zero. However, in most explicit models such 
as the left-right model which provide an explicit derivation of this formula, 
there is an induced vev for the Ae of order < A? >= vr & ak In the 
left-right models, this this arises from the presence of a coupling in the 
Higgs potential of the form A LAR gi. In the presence of the Az vev, the 
seesaw formula undergoes a fundamental change and takes the form 


2 
M, = fvr- hi frh (=) (7.48) 


which includes both the type I and the type II seesaw contributions. In 
Fig. 1, the two mdiagrams responsible for type I and type II seesaw are 
given: 


This left-right symmetric seesaw formula has recently been shown to 
exhibit some interesting duality properties which can perhaps be used to 
restrict some of the arbitrariness in its applications. 
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Fig. 7.1. The type I and type II contribution to seesaw formula for neutrino masses. 


Note that in the type I seesaw formula, what appears is the square of 
the Dirac neutrino mass matrix which in general expected to have the same 
hierarchical structure as the corresponding charged fermion mass matrix. 
In fact in some specific GUT models such as SO(10), Mp = Mu. This 
is the origin of the common statement that neutrino masses given by the 
seesaw formula are hierarchical i.e. my, < My, < M,, and even a more 
model dependent statement that my, : My, : Mv, = M3 : MÊ : mẹ. 

On the other hand if one uses the type II seesaw formula, there is no 
reason to expect a hierarchy and in fact if the neutrino masses turn out 
to be degenerate as discussed before as one possibility, one possible way to 
understand this may be to use the type II seesaw formula. 

Secondly, the type II seesaw formula is a reflection of the parity in- 
variance of the theory at high energies. Evidence for it would point more 
strongly towards left-right symmetry at high energies. 


7.5.1. Understanding detailed mixing pattern for neutrinos 
using the seesaw formula 


Let us now address the question: to what extent one can understand the 
details of the neutrino masses and mixings using the seesaw formulae. The 
answer to this question is quite model dependent. While there exist many 
models which fit the observations, none (except a few) are completely pre- 
dictive and almost always they need to invoke new symmetries or new 
assumptions. The problem in general is that the seesaw formula of type 
I, has 12 parameters in the absence of CP violation (six parameters for a 
symmetric Dirac mass matrix and six for the Mp) which is why its predic- 
tive power is so limited. In the presence of CP violation, the number of 
parameters double making the situation worse. Specific predictions can be 
made only under additional assumptions. 
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For instance, in a class of seesaw models based on the SO(10) group that 
embodies the left-right symmetric unification model or the SU (4)-color, the 
mass the tau neutrino mass can be estimated provided one assumes the nor- 
mal mass hierarchy for neutrinos and a certain parameter accompanying a 
higher dimensional operator to be of order one. To see this, let us assume 
that in the SO(10) theory, the B-L symmetry is broken by a 16-dim. Higgs 
boson. The RH neutrino mass in such a model arises from the nonrenormal- 
izable operator A(16f 16g )?/Mpv. In a supersymmetric theory, if 16-Higgs 
is also responsible for GUT symmetry breaking, then after symmetry break- 
ing, one obtains the RH neutrino mass Mg ~ A(2 x 1016)? /Mpe ~ 4104 
GeV. In models with SU (4)e symmetry, my, ,D ~ m:(Mu) ~ 100 GeV. 
Using the seesaw formula then, one obtains for A = 1, tau neutrino mass 
My, œ 0.025 eV, which is close to the presently preferred value of 0.05 eV. 
The situation with respect to other neutrino masses is however less certain 
and here one has to make assumptions. 

The situation with respect to mixing angles is much more complicated. 
In generic seesaw models, one needs additional family symmetries to un- 
derstand the largeness of both solar and atmospheric mixing angles, as has 
been commented before. It could of course very well be that the Dirac 
coupling in the seesaw formula is similar to the quark Yukawas but large 
neutrino mixings owe their origin to the flavor structure of right handed 
neutrino mass matrix. Or it could be that it is the type II seesaw term 
(the triplet Higgs contribution) dominates the neutrino mass decoupling 
neutrino masses completely from the charged lepton and quark mixings. 

Essentially, one has to arrive at matrices similar to the above examples. 
There are however some exceptional situations such as in a class of minimal 
SO(10) models described below where the overall unification constraints on 
Yukawa textures is enough to explain desired large mixings, without the 
need for any family symmetry. 


7.5.2. General consequences of the seesaw formula for neu- 
trino masses 


In this section, we will consider some implications of the seesaw mechanism 
for understanding neutrino masses. We will discuss two main points. One 
is the nature of the right handed neutrino spectrum as dictated by the 
seesaw mechanism and secondly, ways to get an approximate Le — Ly, — 
L, symmetric neutrino mass matrix using the seesaw mechanism and its 
possible implications for physics beyond the standard model.33 
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For this purpose, we use the type I seesaw formula along with the as- 
sumption of a diagonal Dirac neutrino mass matrix to obtain the right 
handed neutrino mass matrix Mp: 


MRij = mMpiHy MD,j (7.49) 
with 
—1 2 2 
Hy = T+ = a 
-1_ e(s+ce)  s(c—se) € 


(7.50) 


H22 = Om, 2m3 2m3 
2 (s?—c?e?) (2-8) 1 
H23 = Im, 2mMe 2m3 
1 (s—ce)? (c+ se)? 1 
H33 = 2m, 2M2 2M3 


Since for the cases of normal and inverted hierarchy, we have no information 
on the mass of the lightest neutrino mı, we could assume it in principle 
to be quite small. In that case, the above equation enables us to conclude 
that quite likely one of the three right handed neutrinos is much heavier 
than the other two, leading to the so-called two right handed neutrino 
dominance model.3? The situation is of course completely different for the 
degenerate case. This kind of separation of the RH neutrino spectrum is 
very suggestive of a symmetry. In fact we have recently argued that,?^ this 
indicates the possible existence of an SU(2)# horizontal symmetry, that 
leads in the simplest case to an inverted mass pattern for light neutrinos. 
A scenario which realizes this is given below. 


7.5.2.1. Approzimate Le — L, — Lr symmetric mass matrix from 
seesaw 


In this section, we discuss how an approximate Le — L, — Lr, symmetric 
neutrino mass matrix may arise within a seesaw framework. Consider a 
simple extension of the standard model by adding two additional singlet 
right handed neutrinos,?* N1, No assigning them Le — L, — L, quantum 
numbers of +1 and —1 respectively. Denoting the standard model lepton 
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doublets by Ye,u,r, the Le — Lu — Lr symmetry allows the following new 
couplings to the Lagrangian of the standard model: 


L! = (hat, + hopu) HN + hideHNi + MN] C7'No+h.c. (7.51) 


where H is the Higgs doublet of the standard model; C~! is the Dirac charge 
conjugation matrix. We add to it the symmetry breaking mass terms for 
the right handed neutrinos, which are soft terms, i.e. 


Le = (Mı NI CN, + MoNZC No) + h.c. (7.52) 


with e < 1. These terms break Le — L, — L- by two units but since they 
are dimension 3 terms, they are soft and do not induce any 

with e < 1. These terms break Le — L, — L, by two units but since 
they are dimension 3 terms, they are soft and do not induce any new terms 
into the theory. 

It is clear from the resulting mass matrix for the vz, N system that the 
linear combination hav; —h3y,,) is massless and the atmospheric oscillation 
angle is given by tanĝa = ho/hg3; for h3 ~ h2, the 64 is maximal. The see- 
saw mass matrix then takes the following form (in the basis (ve, Da, Ni, N2) 
with Ò, = hav, + h3v,): 


0 0 m4 0 
0 0 0 me 
M = ; 
My 0 eM, M (7 a5) 


0 m M eM 


The diagonalization of this mass matrix leads to the mass matrix of the 
form discussed before. 


7.5.2.2. Tri-bi-mazimal mixing from seesaw 


In this section, we present an example of a seesaw model for the mass matrix 
for neutrinos (Eq. 7.28) that leads to the tri-bi-maximal mixing.” It was 
shown that the Majorana neutrino mass matrix in 7.28 can be realized in a 
combined type I type II seesaw model with soft-broken S3 family symmetry 
for leptons. The type II contribution comes from an S3 invariant coupling 
of lepton doublets to the triplet field A i.e. fagLoaLgA. The most general 
S3 invariant form for f is: 


fa fo fo 
f= |fo fa fo (7.54) 
fo fe fa 
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After the triplet Higgs field A gets vev and decouples, its contribution to 
the light neutrino mass can written as 


a YV vY 
Myr = b a’ b (7.55) 
b' b a! 


where a’ = nien BA fa and b = asint ga fo. We denote Mr as the mass 
of the triplet Higgs and A as the coupling constant between the triplet and 
doublets in the superpotential. 

Coming to the type I contribution, the Dirac mass matrix for neutrinos 
comes from an $3 invariant Yukawa coupling of the form: 


Lp = hy [PRIH (Le = Ly) ae Pro (Ly = L,) 


+ VrR3H(L, — Le)| + h.c. (7.56) 
leading to 
h -h 0 
Y,= | 0 h =h]. (7.57) 
—h 0 h 


In the limit of |Mri.r3| > |Mre|, where a single right-handed neutrino 
dominates the type I contribution, the mixed type I+II seesaw formula 


M, = Mrr- M} M;pMp, (7.58) 


then leads to Eq. 7.28 which gives the tri-bi-maximal mixing matrix. It 
turns out that,°° the charged lepton mass matrix in this case can be made 
diagonal if one of the two lepton Yukawa couplings is set to zero. 


7.6. Neutrino mass and grand unification 


One of the interesting features of the seesaw mechanism is that if one as- 
sumes the Dirac masses to be roughly of order of the up-quark masses, then 
the atmospheric neutrino mass difference would directly measure the mass 
of mg and can be used to get a rough idea of how high the seesaw scale is. 
In order to do this one can use the rough relation my, ~ me which then 
yields Mp ~ 1014 GeV. This is of course not a rigorous argument at all and 
can therefore only be used as a suggestive one. If however one takes this 
seriously, then it suggests that the seesaw scale could be related to the scale 
of grand unification which from arguments of coupling constant unification 
is also of order 101° GeV. In view of other theoretical arguments in favor 
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of GUTs, one may try to understand the neutrino masses within a grand 
unified theory framework. 

The minimal GUT group that appears to have many desirable proper- 
ties is the SO(10) group,*+ whose spinor representation is 16 dimensional 
and is just right for all then SM fermions of one generation plus the right 
handed neutrino needed for implementing the seesaw mechanism. This 
has therefore been extensively studied as a way to understand neutrino 
properties. 


7.6.1. SO(10) Grand Unification of seesaw mechanism and 
predictions for neutrino masses 


In addition to the fermion unification by the 16 dimensional spinor repre- 
sentation, SO(10) contains the B-L as a subgroup and seesaw mechanism 
requires that the process of symmetry breaking down to the standard model 
must break the B-L at a high scale. One implication of this is a natural 
understanding of the seesaw scale as being connected to the GUT scale. 
Secondly in the context of supersymmetric SO(10) models, the way B-L 
breaks has profound consequences for low energy physics. For instance, if 
B-L is broken by a Higgs field belonging to the 16 dimensional Higgs field, 
then the field that acquires a nonzero vev has the quantum numbers of 


the vp field i.e. B-L breaks by one unit. In this case higher dimensional 
operators of the form YYYY py will lead to R-parity violating operators 
in the effective low energy MSSM theory such as QLd*,u°d°d° etc which 
can lead to large breaking of lepton and baryon number symmetry and 
hence unacceptable rates for proton decay. This theory also has no dark 
matter candidate without making additional assumptions. Furthermore, 
since non-renormalizable operators are an essential part of this approach, 
there are many more parameters, making it non-predictive in the absence 
of additional assumptions.*” 

On the other hand, one may break B-L by a 126 dimensional Higgs 
field.43-44 The member of this multiplet that acquires vev has B— L = 2 and 
leaves R-parity as an automatic symmetry of the low energy Lagrangian. 
This then gives a naturally stable dark matter. Furthermore, in this ap- 
proach, since one considers only renormalizable couplings, the number of 
Yukawa parameters are quite limited so that the model is quite predic- 
tive.* The predictivity clearly arises from one irreducible 16 dimensional 
spinor multiplet containing all fermions of each family or complete fermion 
unification. 
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In order to study the predictions of the model, we first note that since 
the SO(10) model contains the left-right subgroup, the seesaw formula takes 
the modified form as in Eq.7.48 that we repeat below.36 


2 
M, = for -hl frh (2x) (7.59) 
UR 


It turns out that if the B-L symmetry is broken by 16 Higgs fields, the first 
term in the type II seesaw (effective triplet vev induced term) becomes very 
small compared to the type I term. On the other hand, if B-L is broken 
by a 126 field, then the first term in the type II seesaw formula is not 
necessarily small and can in principle dominate in the seesaw formula. As 
we discuss below, this leads to predictions for neutrino masses and mixings 
that are in excellent agreement with experiments. 

The basic ingredients of this model*® are that one considers only two 
Higgs multiplets that contribute to fermion masses i.e. one 10 and one 126. 
A unique property of the 126 multiplet is that it not only breaks the B-L 
symmetry and therefore contributes to right handed neutrino masses, but 
it also contributes to charged fermion masses by virtue of the fact that 
it contains MSSM doublets which mix with those from the 10 dimensional 
multiplets and survive down to the MSSM scale. This leads to a tremendous 
reduction of the number of arbitrary parameters. 

There are only two Yukawa coupling matrices in this model: (i) h for 
the 10 Higgs and (ii) f for the 126 Higgs. SO(10) has the property that 
the Yukawa couplings involving the 10 and 126 Higgs representations are 
symmetric. Therefore if we assume that CP violation arises from other 
sectors of the theory (e.g. squark masses) and work in a basis where one 
of these two sets of Yukawa coupling matrices is diagonal, then it will have 
only nine parameters. Noting the fact that the (2,2,15) submultiplet of 126 
has a pair of standard model doublets that contributes to charged fermion 


masses, one can write the quark and lepton mass matrices as follows**: 


My = hkut fru 

Ma = hka or fva 

Me = hka = 3fva 
Mop = hku — 3fvu 


(7.60) 


where Ky,q are the vev’s of the up and down standard model type Higgs 
fields in the 10 multiplet and v,,,¢ are the corresponding vevs for the same 
doublets in 126. Note that there are 13 parameters in the above equations 
and there are 13 inputs (six quark masses, three lepton masses and three 
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quark mixing angles and weak scale). Thus all parameters of the model 
that go into fermion masses are determined. 

To determine the light neutrino masses, we use the seesaw formula in 
Eq. 7.48, where the f is nothing but the 126 Yukawa coupling. Thus all 
parameters that give neutrino mixings except an overall scale are deter- 
mined. A simple way to see how large mixings arise in this model is to 
note that when the triplet term dominates the seesaw formula, we have 
the neutrino mass matrix M, « f, where f matrix is the 126 coupling to 
fermions discussed earlier. 


M, = c(Ma— Me) (7.61) 
All the quark mixing effects are then in the up quark mass matrix i.e. 


Mu = UVexuMiUcK.- Note further that the minimality of the Higgs 
content leads to the following sum-rule among the mass matrices: 


kM z= rMa + Mau (7.62) 
where the tilde denotes the fact that we have made the mass matrices 


dimensionless by dividing them by the heaviest mass of the species. We 
then find that we have 


AS te A3 
Mae ®% m| X X X (7.63) 
A3 A? 1 


where À ~ 0.22 and the matrix elements are supposed to give only the 
approximate order of magnitude. An important consequence of the relation 
between the charged lepton and the quark mass matrices in Eq. 7.62 is 
that the charged lepton contribution to the neutrino mixing matrix i.e. 
Up ~ 1 + O(A) or close to identity matrix. As a result the neutrino mixing 
matrix is given by Upyyng = ULU, ~ U,, since in Up, all mixing angles 
are small. Thus the dominant contribution to large mixings will come from 
U,, which in turn will be dictated by the sum rule in Eq. 7.61. 

As we extrapolate the quark masses to the GUT scale, due to the fact 
that my, — M, ~% m,? for a wide range of values of tang, the neutrino mass 
matrix M, =c(Ma— Me) takes roughly the form 


Xe Oe Oe 
M, = c(Ma = Me) ~ mo A3 A2 A2 (7.64) 
Ne De. OX? 


It is easy to see that both the 012 (solar angle) and 623 (the atmospheric 
angle) are now large. The detailed magnitudes of these angles of course 
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depend on the details of the quark masses at the GUT scale. Using the 
extrapolated values of the quark masses and mixing angles to the GUT 
scale, the predictions of this model for various oscillation parameters are 
given in.?° Some of the salient features are: (i) the atmospheric mixing 
angle 023 is not maximal and the maximum value for it is around 38°; 
(ii) the prediction for sinf13 = U-3 is near 0.18, a value within the reach 
of MINOS as well as other planned Long Base Line neutrino experiments 
such as Numi-Off-Axis, JPARC etc. 


7.6.2. CP violation in the minimal SO(10) model 


In the discussion given above, it was assumed that CP violation is non- 
CKM type and resides in the soft SUSY breaking terms of the Lagrangian. 
The overwhelming evidence from experiments seem to be that CP viola- 
tion is perhaps of CKM type with CKM phase of about 60°. Success of the 
above approach in understanding neutrino mixings suggests that we should 
consider extending the above simple model to accommodate CKM CP vi- 
olation. Several such attempts have been made in recent literature. 1947 

One approach discussed by usł” employs a slight extension of the 
10+126 model by adding a 120 Higgs field. A further Zə symmetry is 
imposed in such a way that the 10 and 126 couplings are real whereas the 
120 couplings turn out to be imaginary. This will add a new piece to all 
fermion masses but in such a way that the b—7 mass convergence still leads 
to large atmospheric mixing as in the 10+126 case. 

The new model is still predictive in the neutrino sector. Of the three 
new parameters, one is determined by the CP violating quark phase. the 
two others are determined by the solar mixing angle and the solar mass 
difference squared. Therefore we lose the prediction for these parameters. 
However, we can predict in addition to 04 which is now close to maximal, 
613 > 0.1 (see figure below) and the Dirac phase for the neutrinos, also 
close to 90°. 

In the above discussion, we assumed type II seesaw terms to dominate. 

This model has been reanalyzed using type I seesaw term to dominant 
in a recent paper!? and a fit to the fermion masses as well as neutrino 
mixings exists for this case. 

Finally, a few comments on what really will constitute a true test of 
the grand unification theories: a key prediction of simple grand unified 
theories such as SU(5) and SO(10) is the existence of proton decay. In 
supersymmetric theories, proton decay turns from an exciting prediction to 
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somewhat of a challenge since the presence of super-partners at the TeV 
scale generates “dangerous” operators such as QQQL /Mv which could lead 
to very rapid proton decay. In fact it is the appearance of these kind of 
operators that has ruled out minimal SUSY SU(5) model. For this reason, 
in the last two papers by us,*” a Yukawa texture was chosen that is in 
accord with current experimental bounds on proton lifetime resulting from 
dimension five operators as the one given above and a fit to neutrino masses 
as well as charged fermions etc was found with type II seesaw. A proton 
decay check is therefore needed for the type I fit to fermions carried out in 
Ref. 48 and proton decay predictions for the model of*” need to be worked 
out. 


7.6.3. Type II seesaw and Quasi-degenerate neutrinos 


In this subsection we like to discuss some issues related to the degenerate 
neutrino hypothesis, which will be necessary if there is evidence for neutri- 
noless double beta decay at a significant level(see for example the recent 
results from the Heidelberg-Moscow group!*) and assuming that no other 
physics such as R-parity breaking or doubly charged Higgs etc are not the 
source of this effect). Thus it is appropriate to discuss how such models 
can arise in theoretical schemes and how stable they are under radiative 
corrections. 

There are two aspects to this question: one is whether the degeneracy 
arises within a gauge theory framework without arbitrary adjustment of 
parameters and the second aspect being that given such a degeneracy arises 
at some scale naturally in a field theory, is this mass degeneracy stable under 
renormali9zation group extrapolation to the weak scale where we need the 
degeneracy to be present. In this section we comment on the first aspect. 

It was pointed out long ago! that degenerate neutrinos arise naturally 
in models that employ the type II seesaw since the first term in the mass 
formula is not connected to the charged fermion masses. One way that has 
been discussed is to consider schemes where one uses symmetries such as 
SO(3) or SU(2) or permutation symmetry S4 so that the Majorana Yukawa 
couplings f; are all equal. This then leads to the dominant contribution to 
all neutrinos being equal. This symmetry however must be broken in the 
charged fermion sector in order to explain the observed quark and lepton 
masses. Such models consistent with known data have been constructed 
based on SO(10) as well as other groups. The interesting point about the 
SO(10) realization is that the dominant contributions to the Am?’s in this 
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model comes from the second term in the type II seesaw formula which in 
simple models is hierarchical. It is of course known that if the MSW solution 
to the solar neutrino puzzle is the right solution (or an energy independent 


solution), then we have Am2,,,,.< AMÊ rmos. In fact if we use the fact 
2 


true in SO(10) models that Mu = Mp, then we have Am4ryos ~ mozi 
and Amiorar © mo = where mo is the common mass for the three 
neutrinos. It is interesting that for mo ~ few eV and fur ~ 10!° GeV, 
both the Am?’s are of right order to the required values. 

Outside the seesaw framework, there could also be electroweak symme- 
tries that guarantee the mass degeneracy. 

The second question of stability under RGE of such a pattern is dis- 


cussed in a subsequent section. 


7.7. Some other consequences of seesaw paradigm 


Generic seesaw models have several other important implications that we 
go into now. For simplicity, we first consider the type I seesaw formula7.38. 
The first question one can ask is that given low energy information, to what 
extent we can discover the high scale physics associated with the seesaw 
mechanism such as the spectrum of right handed neutrinos, the structure of 
the Dirac mass matrix mp (or equivalently Y,). A simple parameter count- 
ing shows that the neutrino masses and mixings (including CP phases) are 
characterized by nine observables whereas seesaw formula involves eighteen 
parameters (in the basis where RH neutrinos are mass eigenstates, there 
are three masses and 15 parameters characterizing mp. Thus we need nine 
more pieces of low energy inputs to completely determine the seesaw physics 
(granted that all observables in the neutrino mass matrix are determined). 
Radiative leptonic decays such as u — e+ 7,7 — u,e + y including both 
CP violating and conserving channels could provide six pieces of informa- 
tion; three electric dipole moments of the charged leptons could provide 
the remaining three. Thus in principle, all the seesaw parameters could be 
determined from low energy observations. 

In discussing the connection between high scale and low scale physics 
for neutrinos, it is often convenient to use a parameterization suggested by 
Casas and Ibarra.’ 


Yitwr = iMp¥?0(M?2)/?uUt (7.65) 


where O is a complex matrix with the property that OOT = 14° and U is 
the neutrino mixing matrix; M@ is the diagonal neutrino mass matrix. The 
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set of matrices O in fact form a group analogous to the complex extension 
of the Lorentz group. Note that six parameters (or three complex angles) 
characterize O, three needed each for Mp and M, and six for U giving 
a total of 18 as we counted above. In special cases where there are sym- 
metries e.g. y — T symmetry, the number of complex angles in reduces to 
only one making the direct connection between high and low energy phases 
somewhat closer. 
For the case of type II seesaw, the corresponding relation is: 


Yvwk = iMr’ O[U*M8U' — Mr? (7.66) 


where ¢ = TA 

It is clear from Eq.7.65 that in general the neutrino mixing matrix is 
only indirectly related to the details of Y, due to the unknown matrix O. 
In a given model however, when Y, is given, O and U get related. 


7.7.1. SUSY seesaw and lepton flavor violation 


In the standard model, the masslessness of the neutrino implies that that 
there is no lepton flavor changing effects unlike in the quark sector. Thus 
the leptons are completely “flavor sterile” and do not throw any light on 
the flavor puzzle. Once one includes the right handed neutrinos Nr one 
for each family, there is lepton mixing and this activates the lepton flavor. 
A simple phenomenological consequence of this “flavor activation” is that 
there appear lepton flavor changing effects such as y > e+ 9, T—>e,u+y 
etc. However, a simple estimate of the one loop contribution to such effects 
shows that the amplitude is of order 

eG Fm Lj mem? 


sig ug (7.67) 


Mw 


A(e j? li + y) a 

T 

This leads to an unobservable branching ratio (of order ~ 1074?) for the 
rare radiative decay modes for the leptons given above. 

The situation however changes drastically as soon as the seesaw mech- 
anism for neutrino masses is combined with supersymmetry. It has been 
noted in many papers already that in supersymmetric theories, the lepton 
flavor changing effects get significantly enhanced. They arise from the the 
mixings among sleptons (superpartners of leptons) of different flavor caused 
by the renormalization group extrapolations which via loop diagrams lead 
to lepton flavor violating (LFV) effects at low energies.°° 

The way this happens is as follows. In the simplest N=1 supergravy- 
ity models,°?! the supersymmetry breaking terms at the Planck scale are 
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taken to have only few parameters: a universal scalar mass mo, universal 
A terms, one gaugino mass ™ 1/2 for all three types of gauginos. Clearly, 
a universal scalar mass implies that at Planck scale, there is no flavor vi- 
olation anywhere except in the Yukawa couplings (or when the Yukawa 
terms are diagonalized, in the CKM angles). However as we extrapolate 
this theory to the weak scale, the flavor mixings in the Yukawa interactions 
induce non universal flavor violating scalar mass terms (i.e. flavor violating 
slepton and squark mass terms). In the absence of neutrino masses, the 
Yukawa matrices for leptons can be diagonalized so that there is no flavor 
violation in the lepton sector even after extrapolation down to the weak 
scale. On the other hand, when neutrino mixings are present or when the 
quarks and leptons are unified in such a way that this diagonalization be- 
comes impossible, there is no basis where all leptonic flavor mixings can be 
made to disappear. In fact, in the most general case, of the three matrices 
Ye, the charged lepton coupling matrix, Y,, RH neutrino Yukawa coupling 
and My,, the matrix characterizing the heavy RH neutrino mixing, only 
one can be diagonalizd by an appropriate choice of basis and the flavor 
mixing in the other two remain. In a somewhat restricted case where the 
right handed neutrinos do not have any interaction other than the Yukawa 
interaction and an interaction that generates the Majorana mass for the 
right handed neutrino, one can only diagonalize two out of the three matri- 
ces (i.e. Y,, Yọ and Mr). Thus there will always be lepton flavor violating 
terms in the basic Lagrangian, no matter what basis one chooses. These 
LFV terms can then induce mixings between the sleptons of different flavor 
and lead to LFV processes. If we keep the Me diagonal by choice of basis, 
searches for LFV processes such as T > u + y and/or u —> e+ y can throw 
light on the RH neutrino mixings/or family mixings in Mp, as has already 
been observed. 

Since in the absence of CP violation, there are at least six mixing an- 
gles (nine if Mp is not symmetric) in the seesaw formula and only three 
are observable in neutrino oscillation, to get useful information on the fun- 
damental high scale theory from LFV processes, it is assumed that My, 
is diagonal so that one has a direct correlation between the observed neu- 
trino mixings and the fundamental high scale paramters of the theory. The 
important point is that the flavor mixings in Y, then reflect themselves in 
the slepton mixings that lead to the LFV processes via the RGEs. 

From the point of view of the LFV analysis, there are essentially two 
classes of neutrino mass models that need to be considered: (i) the first 
class is where it is assumed that the RH neutrino mass My, is either a 
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mass term in the basic Lagrangian or arises from nonrenormalizable terms 
such as v°x°?/Mpzp, as in a class of SO(10) models; (we will such models 
Dirac type) and (ii) a second class where the Majorana mass of the right 
handed neutrino itself arises from a renormalizable Yukawa coupling e.g. 
funt A (we will call them Majorana type models). In Dirac type models, in 
principle, one could decide to have all the flavor mixing effects in the right 
handed neutrino mass matrix and keep the Y, diagonal. In that case, RGEs 
would not induce any LFV effects. However we will bar this possibility and 
consider the case where all flavor mixings are in the Y, so that RGEs can 
induce LFV effects . In Majorana type models on the other hand, there 
will always be an LFV effect, although its magnitude will depend on the 
choice of the seesaw scale (vgz ). 

Examples of class two models are models for neutrino mixings such as 
SO(10) with a 126 Higgs field“? or models with a triplet Higgs, whose vev 
is the seesaw scale. 

In both these examples, the equations that determine the extent of 
lepton flavor violation in leading order, for the case A = 0 are: 


dm. ae 

dt — 16r? 

In the Majorana case, this equation will have contributions from the renor- 

malizable f couplings that give Majorana masses to the right handed neu- 

trinos®? in the sense that generation mixing elements in Y, will be generated 

by f’s even if they were absent in the beginning. Using these equations, 

one can obtain the branching ratios for the radiative lepton flavor violating 
processes using the formula below: 


[smg(¥IY.)] (7.68) 


L= tem; (C10 mlinCr + LiRO wv litCr) FRY + hic. (7.69) 
then the Branching ratio for the decay j — 4i + y is given by the formula 


3 
ABET? Qem 


Be; > li +7) = 
J G4 


(ICL|? + |Crl?)B(; > Li+ 2v). (7.70) 


7.7.2. Renormalization group evolution of the neutrino 
mass matrix 


In the seesaw models for neutrino masses, the neutrino mass arises from 
the effective operator 


LoHLpH 


a (7.71) 


1 
O, = — qo 
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after symmetry breaking < H? >Æ 0; here L and H are the leptonic and 
weak doublets respectively. œ and @ denote the weak flavor index. The 
matrix x becomes the neutrino mass matrix after symmetry breaking i.e. 
< H? >4 0. This operator is defined at the scale M since it arises after 
the heavy field Np is integrated out. On the other hand, in conventional 
oscillation experiments, the neutrino masses and mixings being probed are 
at the weak scale. One must therefore extrapolate the operator down from 
the seesaw scale M to the weak scale Mz.°° The form of the renormalization 
group extrapolation of course depends on the details of the theory. For 
simplicity we will consider only the supersymmetric theories, where the 
only contributions come from the wave function renormalization and is 
therefore easy to calculate. The equation governing the extrapolation of 
the kag matrix is given in the case of MSSM by: 


d 1 

Sp = [393 + 6Tr(YŻYa)]e + zle(YÀYe) + (YÀYe)] (7-72) 
We note two kinds of effects on the neutrino mass matrix from the above 
formula: (i) one that is flavor independent and (ii) a part that is flavor 
specific. If we work in a basis where the charged leptons are diagonal, then 
the resulting correction to the neutrino mass matrox is given by: 


M,(Mz) = (1+8)M(MpB-L)(1+ ô) (7.73) 
where ô is a diagonal matrix with matrix elements fea ~ om e In 
more complicated theories, the corrections will be different. Let us now 
study some implications of this corrections. For this first note that in the 
MSSM, this effect can be sizable if tan 8 is large (of order 10 or bigger). 


7.7.3. Radiative magnification of neutrino mixing angles 


A major puzzle of quark lepton physics is the diverse nature of the mixing 
angles. Whereas in the quark sector the mixing angles are small, for the 
neutrinos they are large. One possible suggestion in this connection is 
that perhaps the mixing angles in both quark and lepton sectors at similar 
at some high scale; but due to renormalization effects, they may become 
magnified at low scales. It was shown in Ref. 54 that this indeed happens 
if the neutrino spectrum os degenerate. This can be seen in a simple way 
for the v, — v, sector.>4 


426 R. N. Mohapatra 


Let us start with the mass matrix in the flavor basis: 


Mg = U*MpUt 
= Co So My 0 Co — So 
E a) ( 0 we & Co ) ; ee) 


Let us examine the situation when ¢ = 0 (i.e. CP is conserved), which 
corresponds to the case when the neutrinos vı and vz are in the same CP 
eigenstate. Due to the presence of radiative corrections to mı and mz, the 
matrix Mr gets modified to 


l+da 0 l+da 0 
Ms — ( 0 1455) Me ( 0 ae ue) 


The mixing angle @ that now diagonalizes the matrix M at the low scale 
u (after radiative corrections) can be related to the old mixing angle 6 
through the following expression: 


x 1 
tan2ð = tan20 (1 + ða +s) 5, (7.76) 
where 
N= (mg = mı)C29 + 263(m1S?2 + m2C3) = 26a(m1CF + M253) 
(m2 = m1)C'26 
(7.77) 
If 


(mı — m2) C29 = 26g(1m193 + M203) — 28a (M1C + M282), (7.78) 


then \ = 0 or equivalently 6 = 7/4; i.e. maximal mixing. Given the mass 
heirarchy of the charged leptons: Mı, K Mig, we expect |da| < ||, which 
reduces (7.78) to a simpler form: 


dmCo9 
= —_,—_, 7.79 
© Gn S2 + mC? ee) 
In the case of MSSM, the radiative magnification condition can be satisfied 
provided provided 


872|Am2(A)|C26 


h (MSSM) ~ in 


(7.80) 


For Am?|simeqAm2, this condition can be satisfied for a very wide range 
of tan p. 
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It is important to emphasize that this magnification occurs only if at the 
seesaw scale the neutrino masses are nearly degenerate. A similar mech- 
anism using the right handed neutrino Yukawa couplings instead of the 
55 Here two conditions 
must be satisfied: (i) the neutrino spectrum must be nearly degenerate (i.e. 
mı ~ Mə as in Ref. 54) and (ii) there must be a hierarchy between the 
right handed neutrinos. 


charged lepton ones has been carried out recently. 


7.7.3.1. An explicit example of a neutrino mass matrix unstable 


under RGE 


In this section, we give an explicit example of a neutrino mass matrix unsta- 
ble under RGE effects. Consider the following mass matrix with degenerate 
neutrino masses and a bimaximal mixing.*© 


0 4 
v2 V2 
M = | a 47-3 (7.81) 
til 
V2 2 2 
The eigenvalues of this mass matrix are (1,—1,1) and the eigenvectors: 
1 1 
p v2 v2 
= 1 1 
+ 1 1 
v2 2 2 
After RGE to the weak scale, the mass matrix becomes 
1 1 
0 Z wll +ô) 
M, = Z 4 —4(1 +ô) (7.83) 
B +4) $(1+6) 4(1+ 26) 


It turns out that the eigenvectors of this matrix become totally different 
and are given by: 


= LL L 
v3 v2 v6 

: = : = 1 
0 —4 v3 


We thus see that the neutrino mixing pattern has become totally altered. 
although the eigenvalues are only slightly perturbed from their unperturbed 
value. 
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7.7.4. Seesaw paradigm and leptogenesis 


Finally let us comment that in models where the light neutrino mass is 
understood via the seesaw mechanism using heavy right handed neutrinos, 
there is a very simple mechanism for the generation of baryon asymmetry 
of the universe. Since the righthanded neutrino has a high mass, it de- 
cays at a high temperature which in combination with CP violation in Y, 
generates a lepton asymmetry.°’ This lepton asymmetry is converted to 
baryon asymmetry via the sphaleron effects®® above the electroweak phase 
transition temperature since sphalerons break B+L conservation. It also 
turns out that one of the necessary conditions for sufficient leptogenesis 
is that the right handed neutrinos must be heavy as is required by the 
seesaw mechanism. To see this note that one of Sakharov conditions for 
leptogenesis is that the right handed neutrino decay must be slower than 
the expansion rate of the universe at the temperature T ~ My,. The 
corresponding condition is: 


heMnp M3 

— < yg* i , 
lop ~Y Mr (e 
hk?Mpe 


This implies that My, > ENV Translating this into a reliable bound 
on the masses of the right handed neutrinos is quite model dependent since 
the Yukawa texture i.e. hg values in the above equation depends on the 
particular way to understand large mixings as well as the neutrino mass 
hierarchy. 
To proceed further, we start with the expression for lepton asymmetry 
in these scenarios:59 
2 


1 1 M; 
I = -> ON Im fy, Yt] F | = 7.86 
i 87 [Y Y; les > a Hi M? J’ l ) 


One can draw several conclusions from this expression: first using Eq. 7.65 
and 7.86, we see that, e? is independent of the low energy CP phases. This 
implies that in principle observation of CP violation in neutrino oscillation 
may not throw any light on the origin of matter. 
Second point to notice is that for hierarchical masses for RH neutrinos 
ie. Mı K Mo 3, one can write 7.86 as 
ae o 

I T) 

= Im[Y, MY; 7.87 
From this it follows that if the neutrino masses are strictly degenerate, then 
using 7.65, it is easy to see that e7 = 0. This is an interesting result although 
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this cannot strictly be used to rule out the possibility of degenerate neutrino 
masses since, it is more natural (as emphasized earlier) for a degenerate 
neutrino spectrum to arise from a type II seesaw rather than type I seesaw 
which has been used in drawing this conclusion. 

Another consequence of 7.86 is that combining 7.85 and 7.86, it is possi- 
ble to obtain a reliable lower bound on the lightest RH neutrino mass® and 
it turns out to be: My, > 10° GeV. This is bound is somewhat strength- 
ened if one further demands that there is a u — T exchange symmetry in 
the left as well as the right handed neutrino sector for which there is some 
observational indication (since 613 appears to be very small). The bound 
then becomes My, > 6 x 10° GeV.®! 

In the presence of triplet contributions to seesaw (type II), there are new 
contributions to the lepton asymmetry given by: The type II contribution 
has been calculated and is given in Refs. 45 and 62 to be 


en Ved SE em 2 
pir e Se ley (: + i) (7.88) 
ST YY Mi Mr 


where u = AMr and A is the coupling between triplet and two doublets 
in the superpotential and f is the triplet coupling to leptons. Thus baryo- 
genesis via leptogenesis is a very sensitive way to probe the neutrino mass 
mechanisms. For more details on this see the lectures by M. C. Chen. 


7.8. Conclusions and outlook 


In summary, the neutrino oscillation experiments have provided the first 
evidence for new physics beyond the standard model. The field of neu- 
trino physics, along with the search for the origin of mass, dark matter has 
therefore become central to the study of new physics at the TeV scale and 
beyond. Another area which is foremost in the minds of many theorists 
is supersymmetry which stabilizes the Higgs mass, provides a way to un- 
derstand the electro-weak symmetry breaking and possibly a dark matter 
candidate. In discussing consequences of seesaw mechanism as well as in 
seeking theories of neutrino mass, we have assumed supersymmetry. An 
exception is the last section, where we consider low scale extra dimensional 
models for understanding Higgs mass and its possibility as an alternative 
to seesaw mechanism. 

What have we learned so far? One thing that seems very clear is that 
there is probably a set of three right handed neutrinos which restore quark 
lepton symmetry to physics; secondly there must be a local B— L symmetry 
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at some high scale beyond the standard model that keeps the RH neutrinos 
so far below the Planck scale. While there are very appealing arguments 
that the scale of B — L symmetry is close to 1014-1016 GeV’s, in models 
with extra dimensions, one cannot rule out the possibility that it is around 
a few TeVs, although the present TeV scale models generally require many 
near TeV particles with sometimes undesirable consequences for flavor vio- 
lation. Third thing that one may suspect is that the right handed neutrino 
spectrum may be split into a heavier one and two others which are nearby. 
If this suspicion is confirmed, that would point towards an SU(2) q hori- 
zontal symmetry or perhaps even an SU(3)# symmetry which breaks into 
an SU(2)y symmetry (although simple anomaly considerations prefer the 
first alternative). 

The correct theory should explain: 

(i) Why both the solar mixing angle is large and atmospheric mixing 
angle maximal? 

(ii) Why the Am2, < Am and what is responsible for the smallness 
of Ue3? While in the inverted hierarchy models and models with u — T 
symmetry, the smallness of U-3 is natural, in general it is not. In fact high 
precision search for Ue3 may hold the clue to possible family symmetries 
vrs simple grad unification. 

(iii) What is the nature of CP phases in the lepton sector and what is 
their relation to the CP phases possibly responsible for baryogenesis via 
leptogenesis? 

(iv) What is the complete mass spectrum for neutrinos? 

These and other questions are likely to prove to be very exciting chal- 
lenges to both theory and experiment in neutrino physics for the next two 
decades. These lectures are meant to be a very cursory overview of what 
seems to be the simplest way to understand neutrino masses and mixings 
i.e. seesaw mechanism and some related physics. Even at that, only a few 
selected topics are covered and for more details and references, we refer the 
reader to other excellent reviews in the literature. 
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Chapter 8 


Searching for the Higgs Boson 


D. Rainwater 


Dept. of Physics and Astronomy, University of Rochester, 
Rochester, NY, USA 


These lectures on Higgs boson collider searches were presented at TASI 
2006. I first review the Standard Model searches: what LEP did, 
prospects for Tevatron searches, the program planned for LHC, and some 
of the possibilities at a future ILC. I then cover in-depth what comes 
after a candidate discovery at LHC: the various measurements one has 
to make to determine exactly what the Higgs sector is. Finally, I discuss 
the MSSM extension to the Higgs sector. 


8.1. Introduction 


Despite all the remarkable progress made early in the 21% century for- 
mulating possible explanations for the weakness of gravity relative to 
the other forces, the nature of dark matter (and dark energy), what 
drove cosmological inflation, why neutrino masses are so small, and what 
might unify the gauge forces, we still have not yet answered the suppos- 
edly more readily accessible problem of electroweak symmetry breaking. 
Just what, exactly, gives mass to the weak gauge bosons and the known 
fermions? Is it weakly-coupled and spontaneous, involving fundamental 
scalars, or strongly-coupled, involving composite scalars? Is the flavor prob- 
lem linked? Do we discover the physics behind dark matter (and its mass), 
gauge unification and flavor at the same time? Or are those disconnected 
problems? 

Our starting point is unitarity, the conservation of probability: the weak 
interaction of the Standard Model (SM) of particle physics violates it at 
about 1 TeV [1]. The theory demands at least one new propagating scalar 
state with gauge coupling to weak bosons to keep this under control. The 
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same problem holds for fermion—boson interactions [2-5], only at much 
higher energy, so is generally less often discussed.* While the variety of 
explanations for electroweak symmetry breaking (EWSB) is vast, what 
we call the Standard Model (SM) assumes the existence of a single fun- 
damental scalar field which spontaneously acquires a vacuum expectation 
value to generate all fermion and boson masses. It is a remarkably com- 
pact and elegant explanation, simple in the extreme. Yet while it tidies 
up the immediate necessities of the SM, it suffers from glaring theoretical 
pathologies that drive much of the model-building behind more ambitious 
explanations. 

Numerous lectures and review articles already exist, covering the SM 
Higgs sector and the minimal supersymmetric (MSSM) extension [6-9], 
which are useful both for learning nitty-gritty theoretical details and serv- 
ing as formulae references. These lectures are instead a crash-course tour 
of theory in practical application: previous, present and planned Higgs 
searches, what happens after a candidate Higgs discovery, and an overview 
of MSSM Higgs phenomenology as a perturbation of that for SM Higgs. 
They are not comprehensive, but do provide a solid grounding in the basics 
of Higgs hunting. They should be read only after one has become intimate 
with the SM Higgs sector and its underlying theoretical issues. Within 
TASI 2006, this means you should already have studied Sally Dawson’s lec- 
tures. After both of these you should also be able to explain to your friends 
how we look for a Higgs boson at colliders (if they care), how to confirm 
it’s a Higgs and figure out what variety it is (since we care), and describe 
how some basic extensions to the SM Higgs sector behave as a function of 
their parameter space (nature might not care for the SM). 

Herein Ill assume that nature prefers fundamental scalars and spon- 
taneous symmetry breaking. This is a strong bias, but one that provides 
a solid framework for phenomenology. The ambitious student who wants 
to really learn all the varieties of EWSB should also study strong dynam- 
ics [16], dimensional deconstruction [17], extra-dimensional Higgsless con- 
structions [18] and the Little Higgs [19] and Twin Higgs mechanisms [20]. 
In many of these classes of theories the Higgs sector appears to be very 
SM-like, but in some no Higgs appears and one instead would pay great 
attention to weak boson scattering around a TeV. 


aThe original study [2] was clearly incorrect, but the correct line of reasoning is a work 
in progress [3-5]. 
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8.2. Collider searches for the Standard Model Higgs 


Even though the SM Higgs sector doesn’t explain flavor (why all the fermion 
masses are scattered about over 12 orders of magnitude in energy) and has 
a disconcerting radiative stability problem that surely must involve new 
physics beyond the SM, it’s a suitable jumping-off point for formulating 
Higgs phenomenology. That is, the study of physical phenomena associated 
with a theory, exploring the connection between theory and experiment. 
Without this connection, experiments would not make sense and theory 
would flail about, untested. To survey SM Higgs collider physics we need 
to recall a few fundamentals about the SM Higgs boson. 


1. The Higgs boson unitarizes weak boson scattering, VV — VV, so 
its interaction with weak bosons is very strictly defined to be the 
electroweak gauge coupling times the vacuum expectation value 
(vev); i.e., proportional to the weak boson masses. 

2. The Higgs also unitarizes VV — ff scattering, so its fermion cou- 
plings (except 1;) are proportional to the fermion mass, with a 
strictly defined universal coefficient. 

3. Because of the coupling strengths, the Higgs is dominantly pro- 
duced by or in association with massive particles (including loop- 
induced processes, as we’ll see in Sec. 8.2.1.1), and prefers to decay 
to the most massive particles kinematically allowed. 

4. The Higgs boson mass itself is a free parameter”, but influences EW 
observables, so we can fit EW precision data to make a prediction 
for its mass. 


We may thus define the SM Higgs sector by its vacuum expectation value, v, 
measured via My, Gr, etc., and the known electroweak gauge couplings; 9 
Yukawa couplings (fermion mass parameters, ignoring neutrinos and CKM 
mixing angles); and one free parameter, My. 

Prior to the Large Electron Positron (LEP) collider era starting around 
1990, Higgs searches involved looking for resonances amongst the low energy 
hadronic spectra in e*e~ collisions. These were in fact non-trivial searches, 
mostly involving decays of hadrons to Higgs plus a photon, but are generally 
regarded as comprehensive and set a lower mass bound of My = 3 GeV. 

Higgs hunting in the 1990s was owned by LEP, an ete~ collider at 
CERN which steadily marched up in energy over the decade. It found no 


bWe know it is not massless, due to the absence of additional long-range forces. 
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Higgs bosons®. Attention then turned to the long-delayed Tevatron Run II 
program, proton—antiproton collisions at 2 TeV, which got off to a shaky 
start but is now performing splendidly. It so far sees nothing Higgs-like, 
either, but has not yet gathered enough data to be able to say much. The 
proton-proton Large Hadron Collider (LHC) at CERN is also many years 
behind schedule, but its construction is now nearing completion and we 
may expect physics data within a few years. 

Our survey begins with LEP from a historical perspective and some gen- 
eral statements about Higgs boson behavior as a function of its mass. Next 
we turn our attention to the ongoing Tev2 search, for which the prospects 
hinge critically on machine performance. Then we delve into the intricacies 
of LHC Higgs pheno, which is far more complicated than either LEP or 
Tevatron, yet essentially guarantees an answer to our burning questions. 


8.2.1. The LEP Higgs search 


An obvious question to ask is, can we produce the Higgs directly in ete7 
collisions? We could then probe Higgs masses up to our machine energy, 
which for LEP-II eventually reached 209 GeV. Recalling that the Higgs- 
electron coupling is proportional to the electron mass, which is quite a 
bit smaller than the electroweak vev of 246 GeV, the coupling strength is 
about 1.5 x 1076, or teeny-tiny in technical parlance. A quick calculation 
reveals that it would take about 4 years running full-tilt to produce just 
one Higgs boson. This one event would have to be distinguished from the 
general scattering cross section to fermion pairs in the SM, which is beyond 
hopeless. 

Instead, we think of what process involves something massive, with 
vastly larger Higgs coupling, so that the interaction rate is large enough 
to produce a statistically useful number of Higgs bosons. The two obvious 
possibilities are ete~ — W*W~H (two W’s required for charge conserva- 
tion) and ete” — ZH. The first process will obviously have less reach in 
My as the two W bosons require far more energy than a single Z boson 
to produce. LEP Higgs searches therefore focused on the latter process, 
shown as a Feynman diagram in Fig. 8.1: the electron and positron anni- 
hilate to form a virtual Z, far above its mass shell, which returns on-shell 
by spitting off a Higgs boson. This process is generically known as Hig- 
gsstrahlung, analogous to bremsstrahlung radiation. Both the Higgs and 


“This may be a somewhat controversial statement, depending on what lunch table you’re 
sitting at. See Sec. 8.2.1.3. 
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Fig. 8.1. Feynman diagram for the process ete~ — ZH with subsequent Higgs and 
Z boson decays to fermion pairs. All LEP Higgs searches were based primarily on this 
process, with various fermion combinations in the final state composing the different 
search channels. 


Z immediately decay to an asymptotic final state of SM particles. For the 
Higgs this is preferentially to the most massive kinematically-allowed pair, 
while Z decays are governed by the fermion gauge couplings. In brief, the 
Z decays 70% of the time to jets, 20% of the time invisibly (to neutrinos, 
which the detectors can’t see), and about 10% to charged leptons, which 
are the most distinctive, “clean” objects in a detector. 


8.2.1.1. Momentary diversion: Higgs decays 


What, precisely, are the Higgs branching ratios (BRs)? To find these, we 
first need the Higgs partial widths; that is, the inverse decay rates to each 
final state kinematically allowed. Everyone should calculate these once as 
an exercise. 

Let’s start with the easiest case: Higgs decay to fermion pairs, which is 
a very simple matrix element. The general result at tree-level is: 


N: Gr m? My 4m? 

& f 3 f 
Tep = ————"_ where 8 = 4/1 — —> 8.1 
ff 4/2 G B M2, ( ) 


One factor of the fermion velocity 8 comes from the matrix element and 
two factors come from the phase space. I emphasize that this is at tree-level 
because there are significant QCD corrections to colored fermions. The bulk 
of these corrections are absorbed into a running mass (see Ref. [8]). For 


dSee the PDG [21] for Z boson branching ratios, which you should memorize. 
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calculations we should always use m,(Mz), the quark mass renormalized 
to the Higgs mass scale, rather than the quark pole mass. Programs such as 
HDECAY [22] will calculate these automatically given SM parameter inputs, 
greatly simplifying practical phenomenology. 

Note that the partial width to fermions is linear in My, modulo the 
cubic fermion velocity dependence, which steepens the ascent with My near 
threshold. Partial widths for various Higgs decays are shown in Fig. 8.2. 
While the total Higgs width above fermion thresholds grows with Higgs 
mass, Higgs total widths below W pair threshold are on the order of tens 
of MeV — quite narrow. The only complicated partial width to fermions 
is that for top quarks, for which we must treat the fermions as virtual (at 
least near threshold) and use the matrix elements to the full six-fermion 
final state, integrated over phase space. This is slightly more complicated, 
but easily performed numerically. 

Before the decay to top quarks is kinematically allowed, however, the 
decays to weak bosons turn on. A few W/Z widths above threshold 
the W and Z may be treated as on-shell asymptotic final states, making 
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Fig. 8.2. Select Standard Model Higgs boson partial widths, as a function of mass, MH. 
Individual partial widths are labeled, while the total width (sum of all partial widths, 
some minor ones not shown) is the black curve. Widths calculated with HDECAY [22]. 
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the partial width calculation easier. We find: 


GrM3 3 ge 

Tyy = A Syb (1 — ty + Za?) where 4 8 = y1 -— zy (8.2) 
16/27 4 ee 
ty = Te 


The factor of @ comes from phase space, while the matrix elements give the 
more complicated function of zy. The partial width is dominantly cubic 
in My, although the factors of beta and xy enhance this somewhat near 
threshold, as in the fermion case. We can see this in Fig. 8.2: the partial 
widths to VV gradually flatten out to cubic behavior above threshold. The 
reason for this stronger My dependence compared to fermions is that a 
longitudinal massive boson wavefunction is proportional to its energy in 
the high-energy limit, which enhances the coupling by a factor E/Mv. 
(Recall that it is this property of massive gauge bosons that requires the 
Higgs, lest their scattering amplitude rise as E?/M?,, violating unitarity. 
The Higgs in fact generates the longitudinal modes.) This much stronger 
dependence on My leads to a very rapid total width growth with My, 
which reaches 1 GeV around My = 190 GeV. We’ll return to this when 
discussing Higgs couplings measurements in Sec. 8.3.3. The bottom line 
is that bosons “win” compared to fermions. Thus, even though the top 
quark has a larger mass than W or Z, it cannot compete for partial width 
and thus BR. Note that the partial widths to VV are non-trivial below 
threshold: the W and Z are unstable and therefore have finite widths; they 
may be produced off-shell. The Higgs can decay to these virtual states 
because its coupling is proportional to the daughter pole masses (or, in 
the case of quarks, the running masses), not the virtual q?, which can be 
much smaller. Below threshold the analytical expressions are known [23] 
(see Ref. [7] for a summary), but are not particularly insightful to derive 
as an exercise. 

The astute reader will have noticed by now that Fig. 8.2 contains curves 
for Higgs partial widths to massless final states! (Have another look if you 
didn’t notice.) We know the Higgs couples to particles proportional to 
their masses, so this requires some explanation. Recall that loop-induced 
transitions can occur at higher orders in perturbation theory. Such interac- 
tions typically are important to calculate only when a tree-level interaction 
doesn’t exist. They are responsible for rare decays of various mesons, for 
instance, and are in some cases sensitive to new physics which may appear 
in the loop. Here, we consider only SM particles in the loop. Which ones 


442 D. Rainwater 


g 
|: aie Åt, b 
g 
Fig. 8.3. Feynman diagram for the loop-induced process H — gg in the SM. All 


quarks enter the loop, but contribute according to their Yukawa coupling squared (mass 
squared). In the SM, only the top quark is important. 


are important? Recall also once again that the Higgs boson couples pro- 

portional to particle mass. Thus, the top quark and EW gauge bosons are 

most important. For H — gg, then, that means only the top quark, while 

for H — yy it is both the top quark and W loops (there is no Z Zy vertex). 

The H — gg expression (for the Feynman diagram of Fig. 8.3) is [24]: 
a2Gr M3, 7 


Pps Erra 2 Ti [1 + (1 = Ti) f (ri)] (8.3) 


[sint VIT rT>i1 


—ł} [In it+vi-zt 


pe (8.4) 
L T in] T<1 


ith a aaa 
wi Ty = VA and f(T) = 


which is for a general quark in the loop with SM Yukawa coupling. It’s easy 
to see that in the SM the b quark contribution, which is second in size to that 
of the top quark, is inconsequential. Remember to use the running mass 
my(Mz) to take into account the largest QCD effects. When you derive 
this expression yourself as an exercise, take care to solve the loop integral 
in d > 4 dimensions, otherwise you miss a finite piece. The H — yy, Zy 
expressions have a similar form [25], but with two loop functions, since it 
can also be mediated a W boson loop (which interferes destructively with 
the top quark loop!): 


_ œ@GrM} á 


Lay = SS 8.5 
TY 128,/2 73 ( ) 


So Nc Qi Fi 


Fy = 2+37[1+(2—-7)f(7)], Fıy2 = —27[1+(1-7)f(7)], Fo = t[1-rf(7)] 

(8.6) 
where Ne; is the number of colors, Q; the charge, and F}; the particle’s 
spin. 
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Now look again more closely at Fig. 8.2. The important feature to no- 
tice is that these loop-induced partial widths are ostensibly proportional 
to M}, like the decays to gauge bosons. However, the contents of the 
brackets, specifically the f(7;) function, can alter this in non-obvious ways. 
For H — gg, Fig. 8.2 shows a slightly more than cubic dependence at low 
masses, leveling of to approximately M}, and flattening out to approxi- 
mately quadratic a bit above the top quark pair threshold. We see from 
Eq. 8.3 that the functional form changes at that threshold, albeit fairly 
smoothly, by picking up a constant imaginary piece when the top quarks 
in the loop can be on-shell. 

The partial widths to yy and Zy behave very differently than gg. For 
My below W pair threshold, the interference between top quark and W 
loops produces an extremely sharp rise with My, which transitions to some- 
thing slightly more than linear in My at W pair threshold where the W 
bosons in the loop go on-shell. There is is a smoother transition at the top 
quark pair threshold, where they can similarly go on-shell. The yy and 
Zy partial widths behave differently because of the different tty and ttZ 
couplings: the partial width to Zy at large My is almost a constant, but 
falls off for yy almost inverse cubic in My. 

Once we’ve calculated all the various possible partial widths, we sum 
them up to find the Higgs total width. Each BR is then simply the ratio 
T;/Ttot- These are shown in Fig. 8.4; note the log scale. If it wasn’t obvi- 
ous from the partial width discussion, it should be now: near thresholds, 
properly including finite width effects can be very important to get the 
BRs correct. Observe how the BR to WW* (at least one W is necessar- 
ily off-shell) is 50% at My = 140 GeV, 20 GeV below W pair threshold. 
BR(H — bb)~BR(H — WWT) at My = 136 GeV. 


8.2.1.2. A brief word on statistics — the simple view 


Now that we understand the basics of Higgs decay, and production in 
electron-positron collisions, we should take a moment to consider statis- 
tics. The reason we must resort to statistics is that particle detectors are 
imperfect instruments. It is impossible to precisely measure the energy of all 
outgoing particles in every collision. The calorimeters are sampling devices, 
which means they don’t capture all the energy; rather they’re calibrated to 
give an accurate central value at large statistics, with some Gaussian uncer- 
tainty about the mean for any single event. Excess energy can also appear, 
due to cosmic rays, beam—gas or beam secondary interactions. Quark final 
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Fig. 8.4. Select Standard Model Higgs boson branching ratios as a function of mass, 
My [22]. The Higgs prefers to decay to the most massive possible final state. The ratio 
of fermionic branching ratios are proportional to fermion masses squared, modulo color 
factors and radiative corrections. 


states hadronize, resulting in the true final state in the detector (a jet) 
being far more complicated and difficult even to identify uniquely. The 
electronics can suffer hiccups, and software always has bugs, leading to im- 
perfect analysis. Thus, we would never see two or three events at precisely 
the Higgs mass of, say, 122.6288... GeV, and pop the champagne. Rather, 
we'll get a distribution of masses and have to identify the central value and 
its associated uncertainty. 

In any experiment, event counts are quantum rolls of the dice. For a 
sufficient number of events, they also follow a Gaussian distribution about 
the true mean: 


f(z; u, 0) = = or( Se) (8.7) 


The statistical uncertainty in the rate then goes as 1/V N, where N is the 
number of events. This is “one sigma” of uncertainty: 68.2% of identically- 
conducted experiments would obtain N within o ~% +N about L = Nerue, 
representing the true cross section. Figure 8.5 shows the fractional prob- 


Searching for the Higgs Boson 445 


Fig. 8.5. Gaussian distribution about a mean u, showing the fractional probability of 
events within one, two and three standard deviations of the mean. 


abilities for various “sigma”, or number of standard deviations from the 
true mean. To claim observation of a signal deviating from our expected 
background, we generally use a 5o criteria for discovery. This means, if 
systematic errors have been properly accounted for, that there is only a 
0.00006% chance that the signal is due to a statistical fluctuation. How- 
ever, this threshold is subjective, and you will often hear colleagues take 
4o or even 30 deviations seriously. Since particle physics has seen dozens 
of three sigma deviations come and go over the decades, I would encourage 


“pay close attention 


you to regard 30 as “getting interesting”, and 40 as 
and ask lots of questions about systematics” . 

Because SM processes can produce the same final state as any ZH com- 
bined BR, we must know accurately what the background rate is for each 
signal channel (final state) and how it is distributed in invariant mass, then 
look for a statistically significant fluctuation from the expected background 
over a fixed window region. The size of the window is determined by de- 
tector resolution: the better the detector, the narrower the window, so the 
smaller the background, yielding a better signal-to-background rate. Gen- 
erally, the window is adjusted to accept one or two standard deviations of 
the hypothesized signal (68-95%). 

Analyses are then defined by two different Gaussians: that governing 
how many signal (and background) events were produced, and that param- 
eterizing the detector’s measurement abilities. The event count N in our 
above expression is the actual number of events observed, in an experiment. 
But in performing calculations ahead of time for expected signal and back- 
ground, it is variously taken as just B, the number of background events 
expected, or S + B, expected signal included, depending on the relative 
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sizes of S and B. For doing phenomenology, trying to decide which signals 
to study and calculate more precisely, the distinction is often ignored. 

The statistical picture I’ve outlined here is quite simplified. Not all 
experiments have sufficient numbers of events to describe their data by 
Gaussians — Poisson statistics may be more appropriate. (An excellent text 
on statistics for HEP is Ref. [26].) Not all detector effects are Gaussian- 
distributed. Nevertheless, it gets across the main point: multiple sources of 
randomness introduce a level of uncertainty that must be parameterized by 
statistics. Only when the probability of a random background fluctuation 
up or down to the observed number of events is small enough, perhaps in 
some distribution, can signal observation be claimed. Exactly where this 
line lies is admittedly a little hazy, but there’s certainly a point of several 
sigmas at which everybody would agree. 


8.2.1.3. LEP Higgs data and results 


Now to the actual LEP search. Electrons and positrons have only elec- 
troweak interactions, so backgrounds and a potential Higgs signal are qual- 
itatively of the same size. (We’ll see shortly in Sec. 8.2.2 how this is not so 
at a hadron collider, which has colored initial states.) LEP thus had the 
ability to examine almost all Z and H decay combinations: bbjj, bbl+é-, 
bbvi, rtr jj, jjjj, etc. The largest of these is bbjj, as it combines the 
largest BRs of both the Z and H. It’s closely followed by bbvi, since a Z 
will go to neutrinos 20% of the time. Neutrinos are missing energy, however, 
so not precisely measured, making it possible that any observed missing en- 
ergy didn’t in fact come from a Z. Jets are much less well-measured than 
leptons, so a narrower mass window can be used for the Z in bbl t47 events 
than bbjj; the smaller backgrounds in the narrower window might beat the 
smaller statistics of the leptonic final state. 

The exact details of each LEP search channel are not so important, 
as lack of observation means we’re more interested in channels’ signal and 
background attributes at hadron colliders. For these lectures I just present 
the final LEP result combining all four experiments. The interested student 
should read Eilam Gross’ “Higgs Statistics for Pedestrians” , which goes into 
much more depth, and with wonderful clarity [27]. 

The money plot is shown in Fig. 8.6. It shows the expected confidence 
level (CL) for the signal+background hypothesis as a function of Higgs 
mass. The thin solid horizontal line at CL=0.05 signifies a 5% probability 
that a true signal together with the background would have fluctuated down 
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Fig. 8.6. Four-experiment combined result of the LEP Standard Model Higgs search. 
No signal was observed, establishing a lower limit of 114.4 GeV. See text of Ref. [27] for 
explanation. 


in number of events to not be discriminated from the expected background. 
The green and yellow regions are the 1o and 20 expected uncertainty bands 
as a function of My, taking into account all sources of uncertainty, calcu- 
lational as well as detector effects. Where the central value (dashed curve) 
crosses 0.05 defines the 95% CL expected exclusion (lower mass limit). This 
is essentially the available collision energy minus the Z mass minus a few 
extra GeV to account for the Z finite width — it may be produced slightly 
off-shell with some usable rate. The solid red curve is the actual experi- 
mental result, which is slightly above the experimental result everywhere, 
meaning that the experiments gathered a couple more events than expected 
in the 115-116 GeV mass bin. 

The end of LEP running involved a certain amount of histrionics. At 
first, the number of excess event at the kinematic machine limit was a 
few, but more careful analyses removed most of these. For example, one 
particularly notorious event originally included in one experiment’s analysis 
had more energy than the beam delivered. Another experiment removed 
a candidate event because some of the outgoing particles traveled down a 
poorly-instrumented region of the detector which was not normally used in 
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Fig. 8.7. Event display of an interesting candidate Z — jj, H — bb event in the Aleph 
detector at the end of LEP-II running and at the machine’s kinematic limit [28]. 


analysis. The final, most credible enumeration was one candidate event in 
one experiment, show in Fig. 8.7. 


8.2.2. Prospects at Tevatron 


With the end of the LEP era, all eyes turned to Run II of the upgraded Fer- 
milab Tevatron. Its energy increased from 1.8 to 1.96 GeV, and is expected 
to gather many tens of times the amount of data in Run I. Higgs-hunting 
hopes were high [29], although it was clear that the machine and both 
detectors have to perform exceptionally well to have a chance, as Teva- 
tron’s Higgs mass reach will not be all that great, and will have significant 
observability gaps in the mass region expected from precision EW data. 
To understand the details and issues, we first need to identify how a 
Higgs boson may be produced in proton-antiproton collisions. Like the 
electron, the light quarks have too small a mass (Yukawa coupling) to pro- 
duce a Higgs directly with any useful rate, discernible against the large 
QCD backgrounds produced in hadron collisions.® Quarks may annihilate, 
however, to EW gauge bosons, which have large coupling to the Higgs; and 


For example, H — bb is the dominant BR of a light Higgs, but QCD b jet pair production 
in hadron collisions is many orders of magnitude larger. Cf. Fig. 8.10. 
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Fig. 8.8. Feynman diagrams for the four dominant Higgs production processes at a 
hadron collider. 


likewise to a top quark pair. Incoming quarks may also emit a pair of gauge 
bosons which fuse to form a Higgs, a process known as weak boson fusion 
(WBF). But high energy protons also possess a large gluon content; recall 
that gluons have a loop-induced coupling to the Higgs. Figure 8.8 displays 
Feynman diagrams for all four of these processes at hadron colliders. The 
questions are, what are their relative sizes, and what are their backgrounds? 
Because of the partonic nature of hadron collisions, the Higgs couplings are 
not enough to tell us the relative sizes; we also need to take into account 
incoming parton fluxes and final state phase space — single Higgs produc- 
tion is much less greedy than ttH associated production, for instance. In 
addition, the internal propagator structure of the processes is important: 
W H,ZH bremsstrahlung are s-channel suppressed, but no other process is. 

The various rates, updated in 2006 with the latest theoretical calcula- 
tions [30, 31], are shown in Fig. 8.9 for a light SM Higgs boson. Students 
not already familiar with hadron collider Higgs physics will probably be 
surprised to learn that gg — H, gluon fusion Higgs production, dominates 
at Tevatron energy. This is partly because the coupling is actually not 
all that small, partly because high-energy protons contain a plethora of 
gluons, and partly because there is no propagator suppression, and much 
less phase space suppression, compared to other processes. Higgsstrahlung 
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Fig. 8.9. Cross sections for Higgs production in various channels at Tevatron Run II 
(Vs = 2 TeV). Note the log scale. Figure from the Tev4LHC Higgs working group [30]. 


(Fig. 8.8(c)) is still important at Tevatron, analogous to LEP. Note that the 
smaller cross sections have more complicated final states, therefore poten- 
tially less background, and possibly distinctive kinematic distributions that 
could assist in separating a signal from the background. It’s not obvious 
that the largest rate is the most useful channel! Considering that the Higgs 
decays predominantly to different final states as a function of its mass, it’s 
also not obvious that the optimal channel at one mass is optimal for all 
masses. In fact, that’s definitely not the case. 

Not knowing the answer, we naturally start by considering the largest 
cross section times branching ratio, gg = H — bb. Just how large is the 
background, QCD pp — bb production? Figure 8.10 shows a variety of 
SM cross section for hadron collisions of various energy, and marks off in 
particular Tevatron and LHC. (The discontinuity in some curves is because 
Tevatron is pp and LHC is pp.) We immediately notice that the bb inclusive 
rate is almost nine orders of magnitude larger than inclusive H — bb. Of 
course the background will be smaller in a finite window about the Higgs 
mass. But jets are not so well-measured, necessitating a fairly large window, 
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Fig. 8.10. Various SM “standard candle” cross sections at hadron colliders of varying 
energy, with Tevatron and LHC marked in particular. Note the log scale. Discontinuities 
are due to the difference between pp for Tevatron and pp for LHC. Figure from Ref. [32]. 
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~15-20 GeV either side of the central value. We lose only a few orders of 
magnitude of the background, taking us from “laughable” to just terminally 
hopeless. 

The general rule of thumb at hadron collider experiments is to require 
a final state with at least one high-energy lepton. This means lower back- 
grounds because the event had at least some EW component, such as a W 
or Z, or came from a massive object, such as the top quark, which is not 
produced in such great abundance due to phase space suppression. 

Tevatron’s Higgs search is rate-limited. We can see this by multiplying 
the 150 GeV Higgs cross section from Fig. 8.9 by the expected integrated 
luminosity of 4-8 fb~! during Run II. Because of this, and the very low effi- 
ciency of identifying final-state taus in a hadron collider environment (unlike 
at LEP), Tevatron’s experiments CDF and DØ focus on the H — bb final 
state where that decay dominates the BR, and Higgsstrahlung to obtain 
the lepton tag. For larger Higgs masses, where H — WtW- dominates, 
gluon fusion Higgs production is the largest rate, but Higgsstrahlung has 
some analyzing power. To summarize [29]: 


My < 140 GeV: H — bb dominates, so we use: 
- WH = &vbb 
- ZH > tt e-bb 
- WH, ZH = jjbb 
- ZH > v0bb 
My = 140 GeV: H — WtW- dominates, so we use: 


- gg > H — WtW- (dileptons) 
- WH = W*+W*tW- (2¢ and 3¢ channels) 


8.2.2.1. VH, H — bb at Tevatron 


While a lepton tag gets rid of most QCD backgrounds, it doesn’t auto- 
matically eliminate top quarks: they decay to Wb, thus the event often 
contains one lepton and two jets, or two leptons and missing energy, in 
addition to the b jet pair. This is the same final state as our Higgs signal, 
with either extra jets or transverse energy imbalance. Kinematic cuts help, 
but because the detectors are imperfect some top quark events will leak 
through. Jet mismeasurement gives fake missing energy, for example (and 
is one of the most difficult uncertainties to quantify in a hadron collider 
experiment). In addition, QCD initial-state radiation from the incoming 
partons can give extra jets. Thus top quark and Higgs signal events quali- 
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tatively become very similar. To control this further the experiments have 
to look at other observables, such as angular distributions of the b jets and 
leptons. Other backgrounds to consider are QCD Wbb production, weak 
bosons pairs where one decays to bb (and thus has invariant mass close to 
the Higgs signal window). 

Figure 8.11 shows the results of a CDF simulation study of WH and 
ZH Higgsstrahlung events at Run II for My = 115 GeV (right at the 
LEP Higgs limit) [33]. First note how the top quark pair and diboson 
backgrounds peak very close to the Higgs mass. Eyeballing the plots and 
simplistically applying our knowledge of Gaussian statistics, we could easily 
believe that this could yield a four or five sigma signal, perhaps combined 
with DØ results. However, carefully observe that the shape of the invari- 
ant mass distribution for background alone and with signal are extremely 
similar: they are both steeply falling; the Higgs signal is not a stand-out 
peak above a fairly flat background. Therein lies a hidden systematic! 
This means that we must understand the kinematic-differential shape of 
the QCD backgrounds to a very high degree of confidence. This is not 
just knowing the SM background at higher orders in QCD, differentially, 
but also the detector response. This criticality is not appreciated in most 
discussions of a potential discovery at Tevatron. It should be obvious that 
an excess in one of these channels would cause a scramble of cross-checking 
and probably further theoretical work to ensure confidence, in spite of the 
statistics alone. We’ll run into this feature again with one of the LHC 
channels in Sec. 8.2.3.1, but quantified. 

CDF has in fact already observed an interesting candidate Higgs event 
in Run II, in the first few hundred pb~!. It is in the ZH — vibb channel 
(a b jet pair plus missing transverse energy). The event display and key 
kinematic information are shown in Fig. 8.12. Given the very low b jet pair 
invariant mass, it’s much more likely that the event came from EW ZZ 
or QCD Zbb production (cf. Fig. 8.11). It therefore doesn’t generate the 
kind of excitement that the handful of events at LEP did. Nevertheless, 
finding this event was a milestone, showing that CDF could perform such 
an analysis and find Higgs-like events with good efficiency. 

Table 8.1 summarizes the 2000 Tevatron Higgs Working Group Report 
predictions for Higgsstrahlung reach in Run II [29]. The results are quoted 
for one detector and per fb~!, hence the rather small significances. CDF 
and DØ will eventually combine results, giving a factor of two in statistics. 
However, it’s not known how much data they’ll eventually collect by 2009 
or 2010, when LHC is expected to have first physics results and CDF & DØ 
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Fig. 8.11. CDF simulations of a 115 GeV signal at Tevatron Run II in ZH (left) and 


WH (right) Higgsstrahlung production with Higgs decays H — bb and assuming 10 fb~! 
is collected [33]. 


detector degradation becomes an issue. Fairly low Higgs masses are shown, 
because when the report was written nobody expected LEP to perform 
as well as it did, greatly exceeding its anticipated search reach. It should 
be obvious that a clear discovery would require a large amount of data, 
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Fig. 8.12. Interesting bbpr event at CDF in Tevatron Run II [34]. 


Table 8.1. Predicted signal significances at Tevatron Run II, 
for one detector and 1 fb~!, for various V H, H — bb searches, 
taken from Ref. [29]. 


Higgs Mass (GeV/c?) 


Channel Rate 90 100 110 120 130 
3 S 8.7 9.0 48 44 37 

lE vbb B 28 39 19 26 46 
S/VB 16 1.4 1.1 0.9 0.5 

. S 12 8 63 47 39 
virbb B 123 70 55 45 47 
S/VB 11 1.0 08 07 06 

= S I2 09 08 08 06 
+ib B 2.9 1.9 2.3 2.8 1.9 
S/VB 0.7 0.7 0.5 0.5 0.4 

_ S 8.1 5.6 3.5 2.5 1.3 
qqbb B 6800 3600 2800 2300 2000 
S/VB 0.10 0.09 0.07 0.05 0.03 


combining multiple channels, and the Higgs boson happening to be fairly 
light; not to mention the QCD shape systematic concern I described earlier 
(but is not quantified). In spite of this apparent pessimism, however, CDF 
and DØ seem to be performing modestly better than expected — higher 
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efficiencies for b tagging and phase space coverage, better jet resolution, 
etc. There is as yet no detailed updated report with tables such as this, 
but there are some newer graphically-presented expectations IIl show as a 
summary. 


8.2.2.2. gg — H — WW- at Tevatron 


For My 2 140 GeV, a SM Higgs will decay mostly to W pairs (cf. Fig. 8.4), 
which has a decent rate to dileptons and has very little SM background — 
essentially just EW W pair production, with some background from top 
quark pairs where both b jets are lost. This channel has some special 
characteristics due to how the Higgs decay proceeds. There is a marked 
angular correlation between the outgoing leptons which differs from the 
SM backgrounds: they prefer to be emitted together, that is close to the 
same flight direction in the center-of-mass frame [35]. 

To understand this correlation, consider what happens if the Higgs de- 
cays to a pair of transversely-polarized W bosons. For W decays, the 
lepton angle with respect to the W* spin follows a (1 + cos p+)? distri- 
bution. That is, the positively-charged lepton prefers to be emitted with 
the W spin, while the negatively-charged lepton prefers to be emitted op- 
posite the W spin. Since the Higgs is a scalar (spin-0), the W spins are 
anti-correlated, thus the leptons are preferentially emitted in the same di- 
rection. For longitudinal W bosons, the lepton follows a sin? 4, distribution. 
The W spins are still correlated, however, and the matrix element squared 
(an excellent exercise for the student) is proportional to (pe- - py) (pe+ + pr). 
Since a charged lepton and neutrino are emitted back-to-back in the W rest 
frame, this is again maximized for the charged leptons emitted together. 
This correlation is shown visually by the schematic of Fig. 8.13. Projected 
onto the azimuthal plane (transverse to the beam), its efficacy is shown in 
Fig. 8.14 by comparison to various backgrounds [29, 36]. 
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Fig. 8.13. Diagram showing the preferred flight direction of charged leptons in H — 
l+ve-v. 
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In addition to this angular correlation, we may also construct a trans- 
verse mass (Mr) for the system, despite the fact that two neutrinos go 
missing [37]. We first write down the transverse energy (pr) of the dilepton 
and missing transverse energy (Er) systems, 


ET,- = VPE- FMga Er = 4/2 + m2, ,- (8.8) 


where I’ve substituted the dilepton invariant mass m?,,- for m2,. This is 
exact at H — WW threshold, and is a very good approximation for Higgs 
masses below about 200 GeV and where this decay mode is open. The W 
pair transverse mass is now straightforward: 


Mtww = (Er T LO pi P — (PT,.. ,- + pr)? (8.9) 
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Fig. 8.14. Dilepton azimuthal angular correlation for a H = WtW — tvg T signal 
and its backgrounds. The efficacy of the cut (vertical line) can easily be estimated 
visually. From the Tevatron Run II Higgs Working Group Report [29]. 
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This gives a nice Jacobian peak for the Higgs signal, modulo detector 
missing-transverse-energy resolution, whereas the SM backgrounds tend to 
be comparatively flat. 

Utilizing these techniques gives Tevatron some reach for a heavier Higgs 
boson, mostly in the mass range 150 S My < 180 GeV, where the BR to 
WwW is significant and the Higgs production rate is not too small. 


8.2.2.3. Tevatron Higgs summary expectations 


Tevatron Higgs physics expectations have changed since the 2000 Report, 
as DØ and CDF have better understood their detectors and made analysis 
improvements. As yet, the only progress summary is from 2003, shown 
in Fig. 8.15. It compares the original Report’s findings, shown by the 
thick curves, with improved findings for the low-mass region, shown by the 
thinner lines. However, the new results do not yet include systematic un- 
certainties, which may be considerable. We should expect some form of a 
new summary expectation sometime in 2007. A final note on the undis- 
cussed WBF production mode: some study has been done (see Sec. II.C.4 
of Ref. [29]), but DØ and CDF both lack sufficient coverage of the forward 
region to use this mode. This is not the case at LHC. 
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Fig. 8.15. Expected required integrated luminosity per experiment required in Run II 
to observe a SM Higgs as a function of My [33]. 
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Run II now has about 1 fb~! of analyzed data, and a Higgs search sum- 
mary progress report is available in Ref. [38], which updates each channel’s 
expectations. 


8.2.3. Higgs at LHC 


Higgs physics at LHC will be similar to that at Tevatron. There is the 
slight difference that LHC will be pp collisions rather than pp. The biggest 
difference, however, is the increased energy, from 2 to 14 TeV. Particle 
production in the 100 GeV mass range will be at far lower Feynman z, 
where the gluon density is much larger than the quark density. In fact, it’s 
useful (for Higgs physics) to think of the LHC as a gluon collider to first 
order. The ratio between gluon fusion Higgs production and Higgsstrahlung 
is thus larger than at Tevatron. Figure 8.16 displays the various SM Higgs 
cross sections, only over a much larger range of My — at LHC, large-My 
cross sections are not trivially small, compared to at the Tevatron. There 
are huge QCD corrections to the gg — H rate (also at Tevatron), but these 
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Fig. 8.16. Cross sections for Higgs production in various channels at LHC (ys = 
14 TeV) [30]. 
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are now known at NNLO and under control [39] (and included in Fig. 8.16). 
They don’t affect the basic phenomenology, however. Knowing that LHC 
is plans to collect several hundred fb~! of data, a quick calculation reveals 
that the LHC will truly be a Higgs factory, producing hundreds of thousands 
of light Higgs bosons, or tens of thousands if it’s heavy. 

Looking back at Fig. 8.10, we see that while the Higgs cross section rises 
quite steeply with collision energy (gg — H is basically a QCD process), so 
do important backgrounds like top quark production. The inclusive b cross 
section is still too large to access to gg > H — bb, but note that the EW 
gauge boson cross sections do not rise as swiftly with energy. Immediately 
we realize that channels like gg — H — W+W7 should have a much better 
signal-to-background (S/B) ratio. (In fact it suffers from non-trivial single- 
top quark [40] and gg > WtW- [41] backgrounds, but is still an excellent 
channel for My Z 150 GeV.) The figure does not show cross sections like 
Wb or Zbb, which grow QCD-like and thus become a terminal problem for 
WH and ZH channels. 

Obviously there are a few significant differences between Tevatron and 
LHC with implications for Higgs physics. We’ll lose access to WH and ZH 
at low mass, at least for Higgs decay to b jets. What about rare decays, since 
the production rate is large? The ttH cross section is large and would yield 
a healthy event rate. It’s complexity is distinctive, so one might speculate 
that perhaps it could be useful. WBF production is also accessible due to 
better detectors, and likewise its more complex signature is worthy of a 
look. It will in fact turn out to be perhaps the best production mode at 
LHC. 

As with Tevatron, we need to understand both the signal and back- 
ground for each Higgs channel we wish to examine. As a prelude to Chap- 
ter 8.3, Higgs measurements, at LHC we won’t want to just find the Higgs 
in one mode. Rather, we’ll want to observe it in as many production and 
decay modes as possible, to study all its properties, such as couplings. 


8.2.3.1. ttH, H — bb 


Let’s begin by discussing a very complex channel, top quark associated 
production at low mass, ttH,H — bb. This was studied early on in the 
ATLAS TDR [42] and in various obscure CMS notes, and found to be a 
sure-fire way to find a light Higgs. Figure 8.17 shows a schematic of such 
an event, with multiple b jets from both top quarks and the Higgs, at least 
one lepton from a W for triggering, and possibly extra soft jets from QCD 
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Fig. 8.17. Left: schematic of the outgoing particles in a typical ttH, H — bb event at 
LHC [47]. Right: early CMS study expectations for a bb mass peak in such events, for 
My = 115 GeV [44, 45]. 


radiation. The schematic is a bit fanciful in the neatness of separation of 
the decay products, but is useful to get an idea of what’s going on. 

These early studies [42-45] were too ambitious, however. The back- 
grounds to this signal are ttbb and ttjj' production, pure QCD processes. 
The extra (b) jets must be fairly energetic, or hard, because the signal is 
a 100+ GeV-mass object which decays to essentially massless objects. De- 
spite this being a known problem [46], these backgrounds were calculated 
using the soft/collinear approximation for extra jet emission implemented 
in standard Monte Carlo tools such as PYTHIA or HERWIG. This greatly 
underestimated the backgrounds. 

The left panel of Fig. 8.18 shows the results of a repeated study by 
ATLAS using a proper background calculation [47]. (Recent CMS studies 
found similar results, and the new CMS TDR [48] does not even bother to 
discuss this channel.) There is no longer any clearly-visible mass peak, and 
S/B is now about 1/6, much poorer. While the figure reflects only 1/10 
of the expected total integrated luminosity at LHC, statistics is not the 
problem. Rather, it is systematic: uncertainty on the exact shape of the 
QCD backgrounds. 

Therein lies the sleeping dragon. Now is a good time to explain how 
systematic errors may enter our estimate of signal significance. Our simple 


fNon-b jets can fake b jets with a probability of about 1% or a little less. 
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Fig. 8.18. Left: results of a more up-to-date ATLAS studying of ttH, H — bb produc- 
tion at LHC, for 30 fb—! of data and a Higgs mass of My; = 120 GeV [47]. The QCD 
backgrounds were calculated with exact matrix elements rather than in the soft/collinear 
approximation. Right: maximum achievable signal significance for two LHC Higgs chan- 
nels as a function of S/B and shape systematic uncertainty A [49], as discussed in the 
text. 


formula is modified: 

S T S L= S/B 

VB VB + BA?) A 

where A is the shape uncertainty in the background, a kind of normalization 
uncertainty. In the limit of infinite data, if S/B is fixed (which it is), signal 
significance saturates. The only way around this is to perform higher-order 
calculations of the background to reduce A (and hope you understand the 
residual theoretical uncertainties). The right panel of Fig. 8.18 shows the 
spectrum of possibilities [49]. For the known 10% QCD shape systematic 
for ttH, even an infinite amount of data would never be able to grant us 
more than about a 30 significance. This could still potentially be useful for 
a coupling measurement, albeit poorly, but will not be a discovery channel 
unless higher-order QCD calculations can improve the situation. Calculat- 
ing even just ttbb at NLO is currently beyond the state of the art, but is 
likely to become feasible within a few years. 

While I don’t discuss it here, top quark associated Higgs production does 
show some promise for the rare Higgs decays to photons. Photons are very 
clean, well-measured, and the detectors have good rejection against QCD 
jet fakes. The final word probably hasn’t been written on this, but the 
CMS TDR [48] does have updated simulation results which the interested 
student may read up on. 


(8.10) 
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8.2.3.2. gg > H- vy 


We’ve just seen that QCD can be a really annoying problem for Higgs 
hunting at LHC. A logical alternative for a low-mass Higgs is to look for 
its rare decays to EW objects, e.g. photons. The BR is at about the two 
per-mille level for a light Higgs, 110 < My < 140 GeV. The LHC will 
certainly produce enough Higgses, but what are the backgrounds like? 

It turns out that the loop-induced QCD process gg — yy is a non-trivial 
contribution, but we also have to worry about single and double jet fakes 
from QCD jy and jj production. This occurs when a leading 7° from 
jet fragmentation goes to photons, depositing most of the energy in the 
EM calorimeter, thereby looking like a real photon. Fortunately, because 
photons and jets are massless, the invariant mass distribution obeys a very 
linear 1/m.,, falloff in our region of interest. The experiments can in that 
case normalize the background very precisely from the sidebands, where 
we know there is no Higgs signal. Shape systematics are not much of a 
concern, thus avoiding the pitfalls of the ttH, H — bb case. 

Figure 8.19 shows the results of an ATLAS study for this channel using 
30 fb~! of data [42], 1/10 of the LHC run program or 3 years at low- 
luminosity running. The exact expectations are still uncertain, mostly due 
to an ongoing factor of two uncertainty in the fake jet rejection efficiency. 
A conservative estimate shows that this channel isn’t likely to be the first 
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Fig. 8.19. ATLAS simulation of gg > H — yy at LHC for My = 120 GeV and 30 fb™1 
of data [42]. The right panel is the mass distribution after background subtraction, 
normalized from sidebands. 
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discovery mode, but would be crucial for measuring the Higgs mass precisely 
at low My, to about 1% [42, 48]. Photon energy calibration nonlinearity 
in the detector may be an issue for the ultimate precision, but is generally 
regarded as minor. We’ll come back to this point in Chapter 8.3 on Higgs 
property measurements. 

While I focus here on the SM, keep in mind that because H — yy is a 
rare decay, it can be very sensitive to new physics. Recall that the coupling 
is induced via both top quark and W loops which mostly cancel. Depending 
on how the new physics alters couplings, or what new particles appear 
in the loop, the partial width could be greatly suppressed or enhanced. 
(Anticipating Chapter 8.4, the interested student could peruse Ref. [51] 
and references therein to see how this can happen in supersymmetry.) 


8.2.3.3. Weak boson fusion Higgs production 


Let us explore this other production mechanism I said isn’t accessible at 
Tevatron, weak boson fusion (WBF). It was long ignored for LHC light 
Higgs phenomenology because its rate is about an order of magnitude 
smaller than gg — H there. However, it has quite distinctive kinemat- 
tics and QCD properties that make it easy to suppress backgrounds, for all 
Higgs decay channels. The process itself is described by an incoming pair 
of quark partons which brem a pair of weak gauge bosons, which fuse to 
produce a Higgs; see Fig. 8.20. 

The first distinctive characteristic of WBF® is that the quarks scatter 
with significant transverse momentum, and will show up as far forward and 
backward jets in the hadronic calorimeters of CMS and ATLAS. The Higgs 
boson is produced centrally, however, so its decay products, regardless of 
decay mode, typically show up in the central detector region. This is shown 
in the lego plot schematic in the right panel of Fig. 8.205. 

The reason for this scattering behavior comes from the W (or Z) prop- 
agator, 1/(Q? — M7). For tchannel processes, Q? is necessarily always 
negative. Thus the propagator suppresses the amplitude least when Q? is 
small. For small Q?, we have Q? = (pz — pi)? ~ E2(1— x)0?, where « is 
Some experimentalists refer to this as vector boson fusion (VBF), even though the 
vector QCD boson (gluon) process of Fig. 8.22 is not included. This will cause increasing 
confusion as time goes by. 
hThe angle ¢ is the azimuthal angle perpendicular to the beam axis. Pseudorapidity 
is a boost-invariant description the polar scattering angle, 7 = — log(tan 2), The lego 


plot is a Cartesian map of the finite-resolution detector in these coordinates, as if the 
detector had been sliced lengthwise and unrolled. 
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Fig. 8.20. WBF Higgs production Feynman diagram and lego plot schematic of a typical 
event. 


the fraction of incoming quark energy the weak boson takes with it, and is 
small. Thus 0 prefers to be small, translating into large pseudorapidity. One 
quark will be scattered in the far forward detector, the other far backward, 
and the pseudorapidity separation between them will tend to be large. We 
call these “tagging” jets. QCD processes with an extra EW object(s) which 
mimics a Higgs decay, on the other hand, have a fundamentally different 
propagator structure and prefer larger scattering angles [52, 53], including 
at NLO [54]. The differences between the two are shown in Fig. 8.21 [55]. 

The second distinctive characteristic is QCD radiation [56]. Additional 
jet activity in WBF prefers to be forward of the scattered quarks. This is 
because it occurs via bremsstrahlung off color charge, which is scattered 
at small angles, with no connection between them. In contrast, QCD pro- 
duction always involves color charge being exchanged between the incoming 
partons: acceleration through 180 degrees. QCD bremsstrahlung thus takes 
place over large angles, covering the central region. Central jet activity can 
be vetoed, giving large background suppression [57]. We won’t discuss it 
further, due to theoretical uncertainties; the interested student may learn 
more from Ref. [58]. 

We’ll see in the next few subsections that WBF Higgs channels are 
extremely powerful even without a central jet (minijet) vetoi. Eventually 


iA technical topic outside our present scope: see Refs. [53, 57-59] and the literature they 
reference. 
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Fig. 8.21. Tagging jet rapidity (left) and separation (right) for WBF Higgs production 
v. QCD tt production [55]. 
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Fig. 8.22. Representative Feynman diagrams for gluon fusion Higgs plus two jets pro- 
duction [60]. 


a veto will be used, after calibration from observing EW v. QCD Zjj 
production in the early running of LHC [53]. There is however another 
lingering theoretical uncertainty, coming from Higgs production itself! 

QCD Higgs production via loop-induced couplings may itself give rise 
to two forward tagging jets, which would then fall into the WBF Higgs 
sample [60]. Some representative Feynman diagrams for this process are 
shown in Fig. 8.22. After imposing WBF-type kinematic cuts (far for- 
ward/backward, well-separated jets, central Higgs decay products), this 
contribution to the WBF sample adds about another third for a light Higgs, 
or doubles it for a very heavy Higgs, Mz = 350 GeV, as shown in the left 
panel of Fig. 8.23. The residual QCD theoretical cross section uncertainty 
is about a factor of two, however, and being QCD it will produce far more 
central jets, which will be vetoed to reject QCD backgrounds. Naively, 
then, gluon fusion H7j is an ~ 10% contribution to WBF, but with a huge 
uncertainty. 

This contribution is a mixed blessing. It’s part of the signal, so would 
hasten discovery. Yet it creates confusion, since at some point we want 
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Fig. 8.23. Left: WBF and gluon fusion contributions to the forward-tagged Hj7 sample 
at LHC. Right: azimuthal angular distributions for the same two processes, showing 
distinctive differences. Figures taken from Ref. [60]. 


to measure couplings, and the WBF and gluon fusion components arise 
from different couplings. Fortunately, there is a difference! WBF produces 
an almost-flat distribution in ¢;;, the azimuthal tagging jet separation, but 
gluon fusion has a suppression at 90 degrees [60]; cf. right panel of Fig. 8.23. 


8.2.3.4. Weak boson fusion H > t*7~ 


Now we know that the WBF signature can strongly suppress QCD back- 
grounds because of its unique kinematic characteristics. We expect that 
H — yy is visible in WBF [48, 61, 62], but being a rare decay in a smaller- 
rate channel, it’s not expected to lead to discovery. Rather, it would be 
a useful additional channel for couplings measurements. Let’s now instead 
discuss a decay mode we haven’t yet considered, H — 7*+7~. This is sub- 
dominant to H — bb in the light Higgs region, My < 150 GeV, but the 
backgrounds are more EW than QCD. We thus have some hope to see it, 
whereas H —> bb remains frustratingly hopeless. 
We first have to realize that taus decay to a variety of final states: 


- 35% T — lvev,, ID efficiency ee ~ 90% 

- 50% T — hiv, “1-prong” hadronic (one charged track), ID effi- 
ciency €n ~ 25% 

- 15% T — hav, “3-prong” hadronic (three charged tracks), which 


are thrown away 


The obvious problem is that with at least two neutrinos escaping, the Higgs 
cannot be reconstructed from its decay products. Or can it? 
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Let’s assume the taus decay collinearly. This is an excellent approxima- 
tion: since 50+ GeV energy taus have far more energy than their mass, so 
their decay products are highly collimated. We then have two unknowns, 
z+ and x_, the fractions of tau energy that the charged particles take with 
them. What experiment measures is missing transverse energy in the x 
and y directions. Two unknowns with two measurements is exactly solv- 
able. For our system this gives [63]: 

2 Mire- 


2,- = E 42m? (8.11) 


m T 
TE 


(an excellent exercise for all students to get a grip on kinematics and useful 
tricks at hadron colliders). An important note is that this doesn’t work for 
back-to-back taus (the derivation will reveal why), but WBF Higgses are 
typically kicked out with about 100 GeV of pr, so this almost never happens 
in WBF. This trick can’t be used in the bulk of gg — H events because 
there it is produced mostly at rest with nearly all taus back-to-back. 

We need a lepton trigger, so consider two channels: rr — +h and 
Tr — L+- (L= e, u). The main backgrounds are EW and QCD Zjj 
production (really Z/y*), top quark pairs, EW & QCD WWjj and QCD 
bbjj production. But after reconstruction, the non-Z backgrounds look 
very different than the signal in x}-x— space, as shown in Fig. 8.24. 

ATLAS and CMS have both studied these channels with full detec- 
tor simulation and WBF kinematic cuts, but no minijet veto, and found 
extremely promising results [55]. Figure 8.25 shows invariant mass dis- 
tributions for a reconstructed Higgs in the two different decay channels, 
assuming only 30 fb~! of data. The Higgs peak is easily seen above the 
backgrounds and away from the Z pole. Mass resolution is expected to be 
a few GeV. 

But this joint study by CMS and ATLAS [55] is not the best we can 
do. The joint study ignored the minijet veto, for instance. While that will 
assuredly improve the situation further, we’re just not sure precisely how 
much. Putting this aside for the moment, there are yet further tricks to 
play to improve the situation. 


The leading idea zeroes in on the fact that missing transverse momen- 
tum (pr) has some uncertainty due to jet energy mismeasurement (those 
imperfect detectors). Using a x? test, one determines which is more likely: 
Z = Ttr or H => rt 
amining the schematics in Fig. 8.26, we see this is tantamount to deciding 


which fit is closer to the center of the pr uncertainty region. Early indica- 


T`, using a fixed Higgs mass constraint [65]. Ex- 
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tions are that this technique would improve S/B by about a factor four, in 
addition to recovering some signal lost using more traditional strict kine- 
matic cuts on 2} and x— (recall Fig. 8.24). This would approximately halve 


Fig. 8.24. Reconstructed x+ v. x— (#1, £2) for a WBF H — rtr” signal v. non-Z 
backgrounds [64]. 
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Fig. 8.25. ATLAS (left) and CMS (right) simulations of WBF H — rtr” events after 
30 fb—! of data at LHC. The Higgs resonance clearly stands out from the background. 
Figures from Ref. [55]. 
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Fig. 8.26. Top: schematic azimuthal projection of WBF H — tt77~ events at LHC. 
Bottom: diagram illustrating the lo uncertainty region (due to jet mismeasurement) of 
missing pr, and how a Z mass or Higgs mass hypothesis can be best fit using a xy? test. 
Figures from Ref. [50]. 
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the data required to discover a light SM Higgs boson using this channel. 
Keep it in mind when we see the current official discovery expectations in 
Sec. 8.2.3.7. Further improvements might also be expected from neural- 
net type analyses, which are coming to the fore now that Tevatron has 
demonstrated their viability. 

A final word on systematic uncertainties. Unlike the tortuous case of 
ttH, H — bb, we don’t have to worry about shape systematics here. The 
dominant background is Zjj production. We can separately examine Z — 
ee, uu, which produces an extremely sharp, clean peak, precisely calibrating 
Zjj production in Monte Carlo. The only uncertainty then is tau decay 
modeling, which is very well understood from the LEP era. 


8.2.3.5. Weak boson fusion H — WTW- 


A natural question to ask is, how well does WBF Higgs hunting work for 
My = 140 GeV, where H — WtW- dominates? We should expect fairly 
well, since it’s the production process characteristics that supply most of the 
background suppression, leaving us only to look for separated reconstructed 
mass peaks. 

For H — WtW- we'll consider only the dilepton channel, as it has 
relatively low backgrounds, while QCD gives a large rate for the other 
possible channel, one central lepton plus two central jets (and the minijet 
veto will likely not work). We’ll therefore rely on exactly the same angular 
correlations and transverse mass variable we encountered in the Tevatron 
case [37] (cf. Eqs. 8.8,8.9). The only critical distinction is then ey v. ee, 
uu samples, as the latter have a continuum background (Z*/y*). These 
are not too much of a concern, however. 

Without going too much into detail, Pl simply say that top quarks are 
a major background, and they have the largest uncertainty. The largest 
component comes from ttj production, where the extra hard parton is far 
forward and ID’d as one tagging jet; a b jet from top decay gives the other 
tagging jet, and the other b jet is unobserved. This background requires care 
to simulate, because the soft /collinear approximation in standard codes is 
no good. There is also a significant contribution from single-top production, 
and off-shell effects are crucial to simulate, which is not normally an issue 
for backgrounds at LHC [66]. Work is still needed in this area to be fully 
prepared for this particular search channel. Fortunately, we may expect an 
NLO calculation of ttj before LHC start [67]. 
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Fig. 8.27. ATLAS simulations of WBF H — WtW- events after 30 fb~! of data at 
LHC for My = 140 GeV (left) and 160 GeV (right). The Higgs signal clearly stands 
out from the background in both cases, although the Jacobian peak is easier to identify 
closer to threshold. Figures taken from Ref. [55]. 


Figure 8.27 shows the results of the same ATLAS/CMS joint WBF 
Higgs study for this channel [55]. The results are extremely positive, with 
S/B > 1/1 without a minijet veto over a large mass range; even for My = 
120 GeV, S/B ~ 1/2, allowing for Higgs observation even down to the LEP 
limit in this channel. The transverse mass variable works extremely well for 
Higgs masses near WW threshold, and reasonably well for lower masses, 
where the W bosons are off-shell. 


8.2.3.6. ttH,H — WtW- at higher mass 


A late entry to the Higgs game at LHC is top quark associated produc- 
tion, but with Higgs decaying to W bosons. Representative Feynman dia- 
grams are shown in Fig. 8.28. Obviously this is intended to apply to larger 
Higgs masses, but turns out to work fairly well even below W pair thresh- 
old [68, 69]. The key is to use same-sign dilepton and trilepton subsamples. 
The backgrounds then don’t come from pure QCD production, rather from 
mixed QCD-EW top quark pairs plus W, Z/7*, WTW—, etc. We would be 
especially eager to observe this channel because, if the HWW coupling is 
measured elsewhere, it provides the only viable direct measurement of the 
top quark Yukawa coupling. More on this in Chapter 8.3. 

A noteworthy features of this channel is that while the ttH cross section 
falls with increasing My, BR(H — W*W ) rises with increasing My in 
our mass region of interest, and the two trends coincidentally approximately 
balance each other. From a final-state rate perspective, this channel is 
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Fig. 8.29. Left: ttH cross section and BR(H —> W*+W) as a function of My. Right: 
the cross section to three different final states after top quark and Higgs decays. Figures 
from Ref. [69]. 


approximately constant over a wide mass range, up to about 200 GeV. 
Figure 8.29 shows this numerically. Figure 8.30 shows ATLAS’s expected 
statistical uncertainty on the top quark Yukawa coupling. It ranges from 
about 20% over a broad mass range for 30 fb~! of data, to about 10% from 
the full LHC run. Systematic uncertainties are currently unexplored. 


8.2.3.7. LHC Higgs in a nutshell 


LHC Higgs phenomenology has come a long way in the decade since the first 
comprehensive studies were reported (e.g. the ATLAS TDR [42]). The old 
studies give a seriously misleading picture of LHC capabilities. Students 
should refer to newer ATLAS Notes and the new CMS TDR [48]. Solid 


474 D. Rainwater 


œ 100 > 60 
Š L=30 fo"' = L=300 fb" 
g$ şs 50 4 m 
< 804 Ss o 2L final state 
ig O 2L final state 
© 3L final state 
© 31 final state 40 5 
60 + A Combined 
A Combined 
30 + 
40 + 
20 + 
20 4 10 4 
0 T T 0 T T 
150 200 150 200 
my (GeV/c?) M, (GeV/c’) 


Fig. 8.30. ATLAS prediction [69] for the top quark Yukawa coupling measurement 
uncertainty (statistical only) from ttH, H — WtW- , for separate leptonic final-state 
channels and combined. 


grounds exist for expecting even more improvements. Fig. 8.31 summarizes 
ATLAS’s projections for multiple Higgs channels as a function of Higgs 
mass. Note especially the new dominance of WBF channels and degradation 
of ttH. 


8.3. Is it the Standard Model Higgs? 


Imagine yourself in 2010 (hey, we’re optimists!), squished shoulder-to- 
shoulder in the CERN auditorium, waiting for the speaker to get to the 
punchline. Rumors have been circulating for months about excess events 
showing up in some light Higgs channels, but not all that would be ex- 
pected. LHC has 40 fb~!, after all. Your experimental friends tell you that 
both collaborations have been scrambling madly, independent groups cross- 
checking the original first analyses. Then the null result slides start passing 
by. No diphoton peaks anywhere. Nothing in the WW or ZZ channels. 
Even CMS’s invisible Higgs search (WBF - tagging jets with no central 
objects at all) doesn’t show anything. Numerous standard MSSM Higgs 
results fly by, invariant mass spectra fitting the SM predictions perfectly. 
The audience becomes restless, irritated. People around you mutter that 
there must not be a Higgs after all. But you realize that the speaker skipped 
mention of the WBF H — rt7~ channel. Then suddenly it appears, and 
there’s a peak above the Z pole, centered around 125 GeV, broader than 
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Fig. 8.31. ATLAS significance projections in multiple Higgs channels w/ 30 fb~! of 
LHC data [55]. 


you’d expect but the speaker says something about resolution will improve 
with further refinement of the tau reconstruction algorithms. It’s also a 
too-small rate, less than half what’s expected. 

So what is this beast? The bump showed up in a Higgs search channel, 
but at that mass it should have shown up in several others as well. If it’s 
Standard Model, that is. At 125 GeV there should be H — Wt+W7 in 
WBF, and H — yy both inclusively and in WBF, although maybe they’re 
still marginal. Photons turned out to be hard at first, and QCD predictions 
weren’t quite on the mark. Quite a few people are on their cell phones 
already. You hear a dozen different exclamations, ranging from “We found 
the Higgs!” to “The Standard Model is dead!”. Quite obviously this is a 
new physics discovery, but what exactly is going on? 

By now you should get the point of this imaginary scenario: finding 
a new bump is merely the start of real physics. For numerous reasons 
you’ve heard at this summer school, some better than others, finding a SM 
Higgs really isn’t very likely. But as we’ll see in Chapter 8.4, SM Higgs 
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phenomenology is a superb base for beyond-the-SM (BSM) Higgs sectors. 
They’re variations on a theme in some sense, with the occasional special 
channel thrown in, like the invisible Higgs search alluded to above. Our job 
will be to figure out what any new resonance is. But how do we go about 
doing that in a systematic way that’s useful to theorists for constructing 
the New Standard Model? 

For starters, we want to know the complete set of quantum numbers for 
any Higgs candidate we find. Standard Model expectations will probably 
prejudice us as to what they are (roughly, at least) based on which search 
channel a bump shows up in. But for the scenario above, I can envision at 
least three very reasonable yet completely different models that would give 
that kind of a result in early LHC running. We should keep in mind that 
further data may reveal more resonances — not everything is easy to see 
against backgrounds, or is produced with enough rate to emerge with only 
1/10 of the planned LHC data. In some cases we would have to wait much 
longer, using data from the planned LHC luminosity upgrade (SLHC) [70]. 
New physics could also mean new quantum numbers that we don’t yet know 
about, so we should be prepared to expand our list of measurements needed 
to sort out the theory, and spend time now thinking about what kinds of 
observables are even possible at the LHC. Some measurements will almost 
certainly require the clean environment of a future high-energy electron- 
positron machine like an ILC [71, 72]. The most complete picture would 
emerge only after combining results [73], which could take than a decade. 
In the meantime we might get a good picture of the new physics, but not 
its details. 

Let’s prepare a preliminary list of quantum numbers we need to measure 
for a candidate Higgs resonance, which Ill generically call ¢. In brackets 
is the SM expectation. I'll order them in increasing level of difficulty. (See 
also the review article of Ref. [74].) 


- electric charge [neutral] 

- color charge [neutral] 

- mass [free parameter] 

- spin [0] 

- CP [even] 

- gauge coupling (gwwa#) [SU (2) with tensor structure g””] 

- Yukawa couplings [m/v] 

- spontaneous symmetry breaking potential (self-couplings) [fixed by 
the mass] 
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Of course, the first two of those, electric and color charge, are known imme- 
diately from the decay products. (A non-color-singlet scalar is a radically 
different beast than the SM Higgs and would have dramatically different 
couplings and signatures.) Mass is also almost immediate, with some level 
of uncertainty that depends almost purely on detector effects. Spin and CP 
are related to some degree, and not entirely straightforward if the Higgs sec- 
tor is non-minimal and contains CP violation. Gauge and Yukawa couplings 
are generally regarded as the most crucial observables, and in some sense I 
would agree. However, I would argue that the linchpin of spontaneous sym- 
metry breaking (SSB) is the existence of a Higgs potential, which requires 
Higgs self-couplings. Measuring these and finding they match to some gauge 
theory with a SSB Higgs sector would to me be the most definitive proof 
of SSB, and strongly suggest that the Higgs is a fundamental scalar, not 
composite. It is also the most difficult task — perhaps not even possible. 

A cautionary note: the results I show in this section are in general 
applicable only to the Standard Model Higgs! This point is often lost in 
many presentations highlighting the capabilities of various experiments, but 
it is very easy to understand. For example, if for some reason the Higgs 
sector has suppressed couplings to colored fermions, then any measurement 
of, say, the b Yukawa coupling, will be less precise, simply because the signal 
rate is lower, yet the background remains fixed. It’s statistics! 


8.3.1. Mass measurement 


As already noted, our Higgs hunt pretty much gets us this quantum number 
immediately, but with some slop driven by detector performance. We want 
to measure it as accurately as possible, but in practice a GeV or so is 
good enough, because theoretical uncertainties in parameter fits tend to 
dominate for most BSM physics. (This is a long-standing problem in SUSY 
scenarios, for example. It may be that we need to know the Higgs mass 
theoretical prediction to four loops [75]; at present only a partial three- 
loop calculation is known [76], and only two-loop results exist in usable 
code [77].) Figure 8.32 shows the CMS and ILC expected Higgs mass 
precision as a function of My [78]. It varies, of course, because different 
decay modes are accessible at different My, and detector resolution depends 
on the final state. In general, photon pairs (H — yy) and four leptons 
coming from Z pairs (H > ZZ — 479+) will give the most precise 
measurement. As a rule of thumb, we may expect per-mille precision over 
a broad mass range, translating typically to a few hundred MeV. 
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Fig. 8.32. Expected SM Higgs mass precision at LHC (for CMS; ATLAS will be slightly 
different but comparable) and a future ILC, as a function of Higgs mass [78]. 


8.3.2. Spin & CP measurement 


Spin and CP (JPC) experimental measurements are linked, because both 
require angular distributions to obtain. Numerous techniques have been 
proposed to address this, with significant overlap but also some unique 
features with each method. Pll highlight the leading proposals which garner 
the most attention from LHC experimentalists today. 

From the observed final state we can tell that the Higgs candidate is a 
boson. We’ll start by assuming that it may be spin 0, 1 or 2, but no higher’. 
Then we recall that the Yang-Landau Theorem [80] forbids a coupling be- 
tween three S = 1 bosons if two of them are identical. Thus, if we observe 
o@— yy, then our new object cannot be spin-1, and C = 1. For the very 
curious student who wants to delve deeper, there is a recent report on CP 
Higgs studies at colliders [81]. 


İS > 3 fundamental particles are believed to have deep problems in renormalizable field 
theory [79]. 
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8.3.2.1. Nelson technique 


The first method is the oldest, developed by Nelson [82]. It assumes the 
object is a scalar or pseudoscalar and relies on the decay angular distri- 
butions to a pair of EW gauge bosons, which decay further. The most 
practical aspect relevant for LHC Higgs physics is in essence a measure- 
ment of the relative azimuthal angle between the decay planes of two Z 
bosons in turn coming from the scalar decay, in the scalar particle’s rest 
frame. See Fig. 8.33 for clarity. One bins the data in this distribution and 
fits to the equation: 


F(¢) = 14+ acos(¢) + 8 cos(2¢) (8.12) 


For a scalar, such as the SM Higgs, the coefficients a and ĝ are functions 
1 


of the scalar mass, and further we have the constraint that a(Mg) > 3- 
In contrast, for a pseudoscalar, a = 0 and @ = —0.25, independent of the 
mass. 

Reference [83] was the first to apply this to the LHC Higgs physics 
program using detector simulation. Assuming 100 fb~ of data, the study 
found that LHC could readily distinguish a SM Higgs from a pseudoscalar 
for My > 200 GeV, and from a spin-1 boson of either CP state from a 
little above that, but not right at 200 GeV; see Fig. 8.34. Applying this 
technique to My < 200 but above ZZ threshold was not examined. 

As a practical matter, H > ZZ“) observation is assured only for both 
Z bosons decaying to leptons (e or u), where there is essentially zero back- 
ground. Unfortunately, this is an extremely tiny branching ratio, only 
0.05% of all H — ZZ events. Some studies consider jj channels, 
which is a ten-times larger sample, in an attempt to increase statistics, but 
this suffers from non-trivial QCD backgrounds. 


8.3.2.2. CMMZ technique 


Reference [84] provides an extension to the Nelson technique below ZZ 
threshold. Its full analysis is far more in-depth, discussing the angular 
behavior of the matrix elements for arbitrary boson spin and parity. It 
first demonstrates how objects of odd normality (spin times parity) can be 
discriminated via angular distributions, but for even normality require a 
further discriminant. That is, a J? = 2+ boson could mimic a SM Higgs 
in angular distribution below ZZ threshold. (Exotic higher spin states can 


KA pseudoscalar doesn’t couple at tree-level to W or Z, but can have a (large) loop- 
induced coupling. 
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Fig. 8.33. Schematic of the azimuthal angle between the decay planes of Z bosons 
arising from massive scalar decay. All angles are in the scalar rest frame. Figure from 
Ref. [83]. 
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Fig. 8.34. Results of the LHC expectations spin/CP study of Ref. [83], showing how a 
SM Higgs could be distinguished from a pseudoscalar or spin-1 boson as a function of 
My. 


be trivially ruled out via the lack of angular correlation between the beam 
and the object’s flight direction.) 

The key discriminant is the differential partial decay rate for the off- 
shell Z boson! It depends on the invariant mass of the final-state lepton 


'Typically only one Z boson is off-shell for My < 2Mz, but this ceases to be a good 
approximation at much lower (but observable) masses. 
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pair and is linear in Z* velocity: 


T ~ B ~ y(My — Mz)? — M2 (8.13) 
Figure 8.35 shows the predicted distributions for 150 GeV spin-0,1,2 even- 
normality objects as a function of M,, the off-shellness of the Z*@*@~. The 
histogram represents about 200 events that a SM Higgs would give in this 
channel after 300 fb~! of data at LHC. Unfortunately there are no error 
bars, although one can estimate the statistical uncertainty for each bin as 
VN and observe that the measurement is likely not spectacular. We can 
expect that CMS and ATLAS will eventually get around to quantifying 
the discriminating power, but it would not be surprising to learn that this 
measurement requires far more data, e.g. at the upgraded SLHC [70]. 
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Fig. 8.35. Differential decay rate as a function of dilepton invariant mass of the off-shell 
Z* in ZZ events, for a 150 GeV SM Higgs v. spin-1 and spin-2 objects of even normality 
and the same mass. The histogram is the SM Higgs case for 300 fb~! of data at LHC. 
Figure from Ref. [84]. 
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8.3.2.3. CP and gauge vertex structure via WBF 


A third technique [85] takes a different approach, but addressing spin and 
CP in a slightly different way. Rather than examine Higgs decays, it notes 
that WBF Higgs production is observable for any Higgs mass, regardless 
of decay mode. Furthermore, the same HVV vertex appears on the pro- 
duction side for all masses, also independent of decay. More precisely, this 
vertex has the structure g”HV,,V, (V = W, Z). This tensor structure is 
not gauge invariant by itself. It must come from a gauge-invariant kinetic 
term (D,,®)'(D#®). Identifying it in experiment would go a long way to 
establishing that the scalar field is a remnant of spontaneous symmetry 
breaking. 

For a scalar field which couples via higher-dimensional operators to 
two gauge bosons, however, we may write down the CP-even and CP-odd 
gauge-invariant D6 operators [86]: 

2 2 


I (teywiw-w + 7 (eteywi wou (8.14) 


fo = Phas ake 


where Ag is the scale of new physics that is integrated out, W#” is the 
W boson field strength tensor, and W = Capu WP is its dual. After 
expanding ® with a vev and radial excitation, we obtain two D5 operators: 


1 1 
= + yey 
b= HWW A + 


HW}, W" 1 
i WaW (8.15) 


where A; are dimensionful but now parameterize both the D6 coefficients 
and the ® vev. 

These two D5 operators produce very distinctive matrix element behav- 
ior. Recalling that the external gauge bosons in WBF are actually virtual 
and connect to external fermion currents, the initial-state scattered quarks, 
we derive the following approximate relations for the CP-even operator, 
using Jı,2 for the incoming fermion currents: 

M $ jey : _ yee 0 70 _ 73 73] pil , >32 

e5 X Lee Juv (N * 92) — Gv 92, x [Ji J2 — Jp I] Pe Pp 

e,5 e,5 

(8.16) 
That is, the amplitude is proportional to the tagging jets’ transverse mo- 
mentum dot product. This is easy to measure experimentally — we just plot 
the azimuthal angular distribution, i.e. angular separation in the plane per- 
pendicular to the beam. It will be minimal, nearly zero, for ¢;; = 7/4. In 
contrast, the g” tensor structure of the SM Higgs mechanism does not 
correlate the tagging jets. The CP-odd D5 operator is different and more 
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Fig. 8.36. Azimuthal angular distributions of the tagging jets in WBF production of a 
SM Higgs v. scalar field coupled to weak bosons via CP-even/odd D6 operators. The 
dotted line in the left panel is the SM background, which is added to the signal curves. 
Figures from Ref. [85]. 


complex, but may be understood by noting that it contains a Levi-Civita 
tensor €””°> connecting the external fermion momenta. This is non-zero 
only when the four external momenta are independent, i.e. not coplanar. 
Thus this distribution will be zero for ¢;; = 0,7. 

Figure 8.36 shows the results of a parton-level simulation for scalars 
in both the mass range where decays to taus would be used, and where 
¢ — WtW- dominates. The SM signal curve is not entirely flat due to 
kinematic cuts imposed on the final state to ID all objects. The D5 oper- 
ators produce behavior qualitatively distinct from spontaneous symmetry 
breaking, with minima for the distributions exactly where expected, and 
orthogonal from each other. It would be essentially trivial to distinguish 
the cases from each other shortly after discovery, regardless of My and the 
particular channel used to discover the Higgs candidate. A key requirement 
for this, of course, is that the discovery searches don’t use this distribution 
to separate signal from background. 

Now, what happens if the Higgs indeed arises from SSB, but new physics 
generates sizable D6 operators? Since Hsm is CP-even, a CP-even D5 op- 
erator would interfere with the SM amplitude, while a CP-odd contribution 
would remain independent. This is illustrated in the left panel of Fig. 8.37. 
The obvious thing to do is create an asymmetry observable sensitive to this 
interference: 

_ 9(Adjy < 1/2) — o( Adis > 1/2) 
?  o(Ads5 < 7/2) + o(AG;5 > 7/2) 


With only 100 fb~1 of data at LHC (one experiment), this asymmetry 
would have access to Ag ~ 1 TeV, which is itself within the reach of LHC, 


(8.17) 
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Fig. 8.37. Left: As in Fig. 8.36, but with interference between the SM Higgs and a 
CP-even D5 operator. Right: the effective reach in As5,e for 100 fb™t at LHC, using only 
the rate information (top) or the asymmetry (bottom). Figures from Ref. [85]. 


likely resulting in new physics observation directly. One caveat: the study 
Ref. [85] was done before the gg — Hgg contamination [60] was known, 
which will complicate this measurement. 


8.3.2.4. Spin and CP at an ILC 


The much cleaner, low-background environment of ete collisions would 
be an excellent environment to study a new resonance’s spin and CP prop- 
JPC can in fact be determined completely model-independently. 
Recalling the LEP search, the canonical production mechanism is ete~ — 
ZH. We would identify the Z via its decay to leptons, and sum over all 
Higgs decays (this is possible using the recoil mass technique, coming up 
in Sec. 8.3.4). J and P are completely determined by a combination of the 
cross section rise at threshold and the polar angle of the Z flight direction 
in the lab, shown in the left panel of Fig. 8.38. The differential cross section 
is [71]: 


erties. 


do 


— x p[1 + af? sin? 0z + bnbcos0z +n?’ 6?’ (1+ cos? 0z)] (8.18) 
dcos Oz 


where a and b depend on the EW couplings and Z boson mass, 17 is a gen- 
eral pseudoscalar (loop-induced) coupling and 8 is the velocity. Far more 
sophisticated analyses techniques exist, often called “optimal observable” 
analyses [87], but are only for the terminally curious. 
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Fig. 8.38. Left: Feynman diagram for ete~ — ZH and schematic [88] showing the 


analyzing angles. Right: curves showing the threshold rate dependence for J = 0,1,2 
states in this channel [71]. 


If one would have the liberty to perform a threshold scan of Zġ produc- 
tion at an ILC, distinguishing given-normality J = 0, 1, 2 states is straight- 
forward due to their different G-dependence. For J = 0 it is linear, but for 
higher spin is higher-power in 8 [88]. The qualitative behavior is shown in 
the right panel of Fig. 8.38, complete with error bars for the SM Higgs case. 
However, while the physics is solid, experiments in the past have generally 
proved to be a horse race for highest energy, so there is no guarantee that 
one would have threshold scan data available. The angular distribution 
fortunately works at all energies. 


8.3.3. Higgs couplings at LHC 


Now to something much harder. It’s commonly believed that LHC cannot 
measure Higgs couplings, only ratios of BRs [42]. This is incorrect, but 
requires a little explanation to understand why people previously believed 
in a limitation. 

First, let me state that the LHC doesn’t measure couplings or any other 
quantum number directly. It measures rates. (This is true for any particle 
physics experiment.) From those we extract various g; -BR, by removing 
detector, soft QCD and phase space effects, among other things, using 
Monte Carlo simulations based on known physics inputs. 
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Fig. 8.39. Left: Standard Model Higgs total width as a function of My. Right: ex- 
pected experimental precision on Ty at ATLAS using the gg > H — ZZ — 4€ chan- 
nel [42] (CMS similar). 


Second, we note that for a light Higgs, which has a very small width 
(cf. Sec. 8.2.1.1), the Higgs production cross section is proportional to the 
partial width for Higgs decay to the initial state (the Narrow Width Approx- 
imation, NWA). That is, ¢gg..4 X T Hgg. Similarly, owar «Ty w+w-- 
The student who has never seen this may easily derive it by recalling the 
definition of cross section and partial decay width — they share the same 
matrix elements and differ only by phase space factors™. Typically we ab- 
breviate these partial widths with a subscript identifying the final state 
particle, thus we have I4, 'y, Te, etc. Since a BR is just the partial decay 
width over the total width, we then write: 


(on - BR), « (=) (8.19) 


where I’, and Ig are the “production” and decay widths, respectively. 
Third, count up the number of observables we have and measurements 
we can make. Assuming we have a decay channel for each possible Higgs 
decay (which we don’t), we’re still one short: Ty, the total width. Now, if 
the width is large enough, larger than detector resolution, we can measure 
it directly. Figure 8.39 shows that this can happen only for My = 230 GeV 
or so [42], far above where EW precision data suggests we’ll find the (SM) 


Higgs. Below this mass range, we have to think of something else. 


™Well, slightly more than that in the case of WBF, but the argument holds after careful 
consideration. 
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In the SM, we know precisely what I y is: the sum of all the partial 
widths. For the moment let’s assume we have access to all possible decays 
or partial widths via production, ignore the super-rare decay modes to first- 
and second-generation fermions. This is a mild assumption, because if for 
some reason the muon or electron Yukawa were anywhere close to that of 
taus, where it might contribute to the total width, it would immediately be 
observable. The list of possible measurements we can form from accessible 
(0 - BR)i ezp is: 


Ky Kr AW: XZ Yas Yw, Yz, lb, Zq; Zw (8.20) 


where X; correspond to WBF channels, Y; are inclusive Higgs production, 
and Z; are top quark associated production” We could easily add mea- 
surements like X,,, Ye, etc. if we wanted, because measuring zero for any 
observable is still a measurement — it simply places a constraint on that 
combination of partial widths or couplings. 

In the original implementation of this idea [89], the authors noted that 
the ttH, H — bb channel won’t work, so there is no access at LHC to I. 
However, there is access to r+. In the SM, the b and r Yukawa couplings are 
related by ra = I/I; = 3cqcpm?/m?, where cacp contains QCD higher- 
order corrections and phase space effects. [yw and Iz are furthermore 
related by SU(2)z, although we don’t need to use it. Now write down the 
derived quantity 


Tw = X-(l+r)+Xwt+Xz+X,+X, = (on) = (1-6)I'w 

(8.21) 
where X; is constructed from Xw, X4, Yw and Y}. Although T, is an 
infinitesimal contribution to T y, it is important as above, and it contains 
both the top quark Yukawa and W gauge-Higgs couplings. Our error is 
contained in € and is typically small. This provides a good lower bound on 
Tw from data. The total width is then 


Ty 
Ty = — (8.22) 
Xw 

and the error goes as (1 — €)~?. Assuming systematic uncertainties of 5% 
on WBF and 20% on inclusive production, this would achieve about a 10% 
measurement of [yw and 10 — 20% on the total width for My < 200 GeV. 
"For this case, we actually use the Yukawa coupling squared (y2) instead of r+, because 
decays to top quarks is kinematically forbidden. But this is irrelevant for our argument. 
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Voilà! We have circumvented the naïve problem of not enough inde- 
pendent measurements. The astute observer should immediately protest, 
however, and rightly so. The result is achieved with a little too much con- 
fidence that the SM is correct. Not only does the trick rely on a very 
strong assumption about the b Yukawa coupling, but there could be funny 
business in the up-quark sector, giving a large partial width to e.g. charm 
quarks, which would not be observable either via production (too little 
initial-state charm, and anyhow unidentifiable) or decay (charm can’t be 
efficiently tagged). Nevertheless, this was a useful exercise, because a much 
more rigorous, model-independent method is closely based on it. 

The more sophisticated method is a powerful least-likelihood fit to data 
using a more accurate relation than Eq. 8.19 between data and theory [90]: 


on" pla 
on: BR(H > zz) = TSM ` É; (8.23) 
P 


where the partial widths in the box are the true values to be extracted from 
data, and the (a /T)sm ratio in front quantifies all effects shoved into Monte 
Carlo using SM values: phase space, QCD corrections, detector, etc. As 
before, the “sum” of all channels provides a solid lower bound on T p, simply 
because some rate in each of a number of channels requires some minimum 
coupling. But these are found by a fit, rather than theory assumptions. 
It also properly takes into account all theory and experimental systematic 
and statistical uncertainties assigned to each channel. We then need only 
a firm upper bound on Ty and the fit then extracts absolute couplings 
(transformed from the partial widths). This bound comes from unitarity: 
the gauge-Higgs coupling can be depressed via mixing in any multi-doublet 
model, as well as any number of additional singlets, but it cannot exceed 
the SM value, which is strictly defined by unitarity. Thus Ty < (ee 
(This bound is invalid in triplet models, but these have other characteristics 
which should make themselves apparent in experiment.) The WBF H — 
W*W~ channel then provides an upper limit on Ty via its measurement 

The method can be further armored against BSM alterations by in- 
cluding the invisible Higgs channel, allowing additional loop contributions, 
and so on. Of course, the more possible deviations one allows, the larger 
the fit uncertainties become. We see this in the differences between the 
left and right panels of Fig. 8.40 [90]. It is obvious that LHC’s weakness 
is lack of access to H — bb. Nevertheless, LHC can measure absolute 
Higgs couplings with useful constraints on BSM physics. This is especially 
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Fig. 8.40. Left: a least likelihood general fit on simulated LHC data, with no additional 
assumptions about the Higgs sector. Right: the fit assuming no new particles appear in 
Higgs loop-induced decays, and the gauge-Higgs coupling fixed exactly to the SM value. 
Figures from Ref. [90]. 


true for My 2 150 GeV, where LHC can achieve O(10%) precision on the 
gauge-Higgs couplings and the total width. 

The fit as implemented in Ref. [90] fixes My. This is a slight cheat, since 
for some My the BRs change quite rapidly, and a 1-2 GeV uncertainty can 
lead to a lot of slop in the coupling extraction. This is especially critical for 
the Higgs sector of the Minimal Supersymmetric Standard Model (MSSM). 
Eventually a fit to My will also have to be included, which will degrade 
measurement precision somewhat. 

At the same time, there is cause for optimism. The results of Fig. 8.40 
were based on very conservative, almost pessimistic assumptions: overly- 
large systematic errors, WBF not being possible at all at high-luminosity 
running, no minijet veto for WBF (cf. Sec. 8.2.3.3), and lack of progress 
in higher-order QCD calculations for signals and backgrounds. The reality 
is that significant progress has been made regarding QCD corrections, and 
we'll see one example shortly. Also, everyone knows that the minijet veto is 
a qualitatively correct aspect of the physics, we just can’t accurately predict 
its impact. Early LHC data from Zjj production should take care of this. 
Furthermore, ATLAS and CMS experimentalists fully expect WBF to work 
at high-luminosity LHC running, they just don’t have full simulation results 
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for the probable efficiencies. Also, we may expect far better performance 
in the WBF H — 7t7~ channels as discussed in Sec. 8.2.3.4. Finally, if 
new physics exists up to a few TeV, it will be observable and we can take 
it into accounts in Higgs loop-induced decays. 

Now to QCD corrections. Ref. [90] used large QCD uncertainties for 
Ogg>H and T4, 20% each, which is the correct NNLO uncertainty for each 
by itself. However, these two quantities appear as a ratio in our observables 
formula, Eq. 8.23. As pointed out in Ref. [91], most of these uncertainties 
drop out in the ratio. The reason for this is that the QCD corrections to 
the cross section and partial width are largely the same: 


T ~ aS (ur)C? (uR)[1 + as(ur)Xi +- (8.24) 
o ~ asl(ur)CT(uR)[L + as(ur)%1 + --.] (8.25) 


The correct uncertainty on the ratio is 5%, which will have an enormous 
impact on the fits of Fig. 8.40. We eagerly await new results from this and 
other improvements! 


8.3.4. Higgs couplings at an ILC 


Measuring Higgs couplings at an e*e~ collider would be far more straight- 
forward and rely on far fewer theoretical assumptions. Between that and 
being a colorless collision environment, it would also involve far fewer sys- 
tematic uncertainties. I'll outline the basic idea. 

In fixed-beam collisions it’s possible to measure the total ZH production 
rate. To see this, we just apply a little relativistic kinematics, rewriting the 
invariant Mĉ,: 


M?, =p} = (p+ +p- - pz) = s + Mż — 2Ez Vs (8.26) 


We see that observing the Higgs and measuring its total rate boils down 
to observing Z bosons via their extremely sharp dimuon peak and plotting 
this recoil mass. Figure 8.41 shows what the resulting event rate looks like 
in this distribution. The Higgs peak is clearly visible and sidebands allow 
one to subtract the SM background in the signal region. This captures all 
possible Higgs decays, even though that aren’t taggable or even identifiable, 
simply by ignoring everything in the event except for the Z dimuons. 

Simulations [71] suggest that the recoil mass technique would allow for 
about a 2.5% absolute measurement of the ZH rate. Since the cross section 
depends on the Z—Higgs coupling squared, the coupling uncertainty is then 
about a percent. 
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Event rate of the recoil mass for et+e~ — utu +X at a future high-energy 


linear collider. ZH production will fall into this sample, but the Higgs decays are ignored, 
thus capturing the total Higgs production rate. (Figure modified from Ref. [92] for a 


public talk by one of the a 


uthors. ) 


Getting from this one coupling and the total rate to any other coupling 


is formulaic: 


1. In the total rate, measure the best branching ratios, whatever they 
may be. Depending on the mass and detector performance, that’s 
likely one of bb, yy or WTW- decays. 

2. Now look in WBF Higgs production® with the Higgs decaying to 
the same best final state. This yields the partial width Dw. 

3. Calculate the total Higgs width as lw/BR(H —> WtW). 

4. Any other measured BR now gives that individual partial width, 
therefore the relevant coupling (or couplings for some loop-induced 


decays). 


Table 8.2 enumerates the results of ILC simulation for select My [93]. 
(Clearly more thorough work should be done here.) There are a few note- 
worthy features. First, H — bb would be accessible even as a rare BR at 


°For a linear collider this is both e+ e— 


— ete—H and ete 


— vVH, since e and v are 


distinguishable. Experimentally they become two different analyses. 
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Table 8.2. Estimated precision on various SM Higgs partial 
widths for a few select values of My, from measurements at 
a future e+e~ collider [93]. 


My (GeV) 120 140 160 180 200 220 
Decay Relative precision on T; (%) 
bb 1.9 2.6 6.5 12.0 | 17.0 | 28.0 
ee 8.1 19.0 
Ttr 5.0 8.0 
gg 4.8 14.0 
wtw- 3.6 | 2.5 | 21 
ZZ 16.9 
yy 23.0 
Zy 27.0 


larger My, due to the nearly QCD-free collision environment. Second, a 
weak measurement of H — cé should be possible, for the same reason, and 
due to the superior b v. c resolution of the next generation of collider de- 
tectors. Third, H — jj is also accessible. This would be attributed to gg, 
which is a mild theoretical assumption. It is in principle sanity-checkable by 
the absence of an anomalous high-x Higgs production rate at LHC, which 
would come from sea or valence quarks and a non-SM coupling to lighter 
fermions (which would be difficult to accommodate theoretically, so not 
expected). 

But what about the top Yukawa coupling? Its anticipated value of ap- 
proximately one is curious enough to warrant special attention. A light 
Higgs can’t decay to top quark pairs, so we’d have to rely on top quark 
associated production, as at LHC but without all the nasty QCD back- 
grounds. However, the event rate is far lower than at LHC and would 
require an 800 GeV machine collecting 1000 fb~! [94], the planned lifetime 
of a next-generation second-stage machine (justifying my previous state- 
ment about the drive to go to maximum energy and sit there). One study 
combined expected LHC and ILC results [73, 95], and there are more recent 
results for ILC, summarized in Fig. 8.42 [96]. SLHC and an ILC would be 
complementary, granting superb coverage of My for a y measurement at 
the 10% level. 

More sophisticated LC Higgs coupling analyses exist [87], but aren’t of- 
ten reviewed. They use a more complicated “optimal observables” (detailed 
kinematic shape information, for example) scheme. It’s more powerful, but 
doesn’t lend itself to the simplistic formulaic approach I just discussed. 

I should emphasize that the results I reviewed are relevant only for the 
Standard Model. If the Higgs sector is non-minimal, or any new physics 
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Fig. 8.42. Top Yukawa coupling measurement expectations for a future 800 GeV ete~ 
collider [96]. 


appears at the weak scale, it could result in altered couplings (and usu- 
ally does; see Chapter 8.4). If they’re suppressed, the event rate goes 
down, resulting in greater uncertainty. This is often glossed over or ig- 
nored in discussions of Higgs phenomenology, but is a potential reality and 
something we’d just have to lump. Nevertheless, it should be clear by 
now that an ILC would be a spectacular experiment for precision Higgs 
measurements. 


8.3.5. Higgs potential 


Finally we arrive at the most difficult Higgs property to test, the potential. 
This is the hallmark of spontaneous symmetry breaking, thus ranks at least 
as high in priority as finding Yukawa couplings proportional to fermion 
masses. To see what’s involved, let’s review the SM Higgs potential. The 
potential is normally written as: 


V(®) = pSt + (O16)? (8.27) 


where ® is our SU(2); complex doublet of scalar fields. The Higgs spon- 
taneous symmetry-breaking mechanism is what happens to the Lagrangian 
when u? < 0 and the field’s global minimum shifts to v = \/—p?/A. We 
then expand ® — v + H(z) (ignoring the Goldstone modes which you 
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learned about in Sally Dawson’s lectures) where H (æ) is the radial excita- 
tion, the physical Higgs boson. The Higgs mass squared is then 2v?A, and 
is the only free parameter, although constrained (weakly) by EW precision 
fits. The student performing this expansion will also notice HHH and 
HHHH Lagrangian terms, which are self-interactions of the Higgs boson. 
The three- and four-point couplings are —6vX and —6,, respectively”. 

To measure the potential is to measure these self-couplings and check 
their relation to the measured Higgs mass. Our phenomenological approach 
is to rewire the Higgs potential in terms of independent parameters and the 
Higgs candidate field nz: 


1 1~ 
Vina) = 5 Mi na + Aun, + 7A" (8.28) 


À and À are now free parameters, which we measure from the direct pro- 
duction rate of HH and HHH events. This will ultimately be a voyage of 
frustration. 


8.3.5.1. HH production at LHC 


We begin with Higgs pairs at LHC. The dominant production mechanism 
is gluon fusion, gg —> HH [97-99]. The Feynman diagrams are shown in 
Fig. 8.43. The first diagram is off-shell single Higgs production which split 
via the three-point self-coupling to a pair of on-shell Higgses, which then 
decay promptly. The second diagram is a box (four-point) loop contribu- 
tion which involves only the top quark Yukawa coupling. Interestingly, the 
two diagrams interfere destructively and have a rather large cancellation. 
This means the rate is small [100], as shown in Fig. 8.44, making our life 
difficult with a small statistical sample. On the other hand, the destructive 
interference will turn out to be crucial to making constructive statements 
about the self-coupling A. 

The left panel of Fig. 8.44 tells us that we can expect O(10k) light Higgs 
pair events per detector over the expected 300 fb~! lifetime of the first LHC 
run, and ten times that at SLHC. That sounds like a lot, but keep in mind 
that both Higgses have to decay to a final state we can observe, which will 
reduce the captured rate to something much smaller. Then we have to 
consider what backgrounds affect each candidate channel. 

The right panel of Fig. 8.44 shows selected Higgs pair branching ratios. 
At low mass, decays to b pairs dominate, as expected, while for My = 
135 GeV mass it’s W pairs. We can immediately discount the 4b final state 


PDon’t forget the identical-particle combinatorial factors. 
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Fig. 8.43. Feynman diagrams for the dominant Higgs pair production rate at LHC, 
gg > HH. 
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Fig. 8.44. Left: Higgs pair production cross sections at LHC as a function of My [100]. 
Arrows show the change of the cross section as A is increase, and the tips are at one- 
half and twice the SM value. Right: Higgs pair branching ratios as a function of My, 
calculated using HDECAY [22]. 


as hopeless, based on what we already learned about QCD backgrounds — 
but 4W is promising for higher masses. The next-largest mode from those 
two is bbW+W~, which unfortunately is the same final state as the far 
larger top quark pair cross section. A few minutes’ investigation causes 
this to be discarded, even after trying various invariant mass constraints; b 
pair mass resolution is just not good enough. The bbr+7r~ mode has very 
low backgrounds, comparable to the signal, but suffers hugely from lack of 
statistics, due to low efficiency for subsequent tau decays. However, the 
rare decay mode bbyy is extremely clean and worth further consideration 
at low masses. 


HH —> WW- W *W_ at LHC 


HH —W*W-W TW has myriad decays, but for triggering purposes 
and to get away from QCD background sources of leptons (like top quarks) 
we need to select special multilepton final states [101]. The most likely 


496 D. Rainwater 


0.8 > m~r eae 
| 7 *s l 
L N pp rd? +4j | 
t 
F N my=180 GeV 1 
— 06H i _ 
£ H 2 AA 4 
ae H A 4 
s5 L + : J 
f: ttw \ 
K= [ k \ q 
Be 04 N =| 
< b 4 4 
uo) = y 
T Fo HH, m>% \wtwtw jj J 
© j Sey \ | 
N 
p he Ka $ 7 
0.2 = ` ` -~ 
0.0 : 
1.0 1.5 2.0 2.5 3.0 
AR(jj) min 


Fig. 8.45. Differential cross section as a function of the minimum jet pair lego plot 
separation for ¿tT + 4j at events at LHC. The solid curve is the correct distribution 
using exact matrix elements for HH, while the dash-dotted curve comes from effective- 
Lagrangian matrix elements where the top quark mass is taken to infinity. Figure taken 
from Ref. [101]. 


accessible channels are same-sign lepton pairs, (+@+ + 4j, and three lep- 
tons, 4+7 L+ + 2j, since the principal QCD SM backgrounds can’t easily 
mimic them. Note that because of multiple neutrinos departing the de- 


tector unobserved, complete reconstruction is not possible. The principle 
backgrounds are WWW jj, ttW, ttj, ttZ/y* and WZ + 4j, but we also 
need to consider titt, 4W, WTW- + 4j, WtW~ Zjj as well as double 
parton scattering and overlapping events. The calculation of all of these 
is technical so I won’t go into it, rather simply mention a few noteworthy 
points. 

The first is a warning about using the gg — HH effective Lagrangian 
in practical calculations. It is still a mystery why the leading term in the 
V3/m, expansion [97] should get the overall rate so close that of an exact 
calculation [98], but it does. Because of that, nobody has ever bothered to 
calculate higher-order terms in the effective Lagrangian expansion; in any 
case, the exact results are available, as well as NLO in QCD [99]. However, 
the leading terms in the expansion cancel too much close to threshold, 
yielding incorrect kinematics [101], as can be seen from Fig. 8.45. One 
should thus use only the exact matrix element results for practical gg — 
HH phenomenology. 
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Fig. 8.46. Visible invariant mass distribution for same-sign dilepton plus four jet event 
at LHC [101]. All SM backgrounds are summed into one curve, while the gg > HH 
signal is shown separately, for the SM value of self-coupling A, twice that value, and zero. 


The second point is that our main systematic uncertainties will be our 
limited knowledge of the top quark Yukawa coupling, which drives the 
production rate, and the BR to W*W~, which drive the decay fraction. 
These must be known very precisely for any measurement to be useful. 

We will need a discriminating observable to separate signal from back- 
ground. We can speculate that nearly all the signal’s kinematic information 
is encoded in the invariant mass of the visible final state particles, so let’s 


construct a new variable, mys: 


2 2 
Mois = So Ei = Xp: 
i i 


where 7 are all the leptons and jets in the event. We suspect a difference 
because the signal is a two-body process, which is threshold-like, while the 
backgrounds are multi-body processes which peak at much larger Myis than 
the sum of their heavy resonances’ masses. 

Figure 8.46 displays the fruits of parameterizing our ignorance (or 
rather, the detector’s). The separation between signal and background 
is exactly as expected: the signal peaks much lower, allowing a x? fit to 
distinguish it from the backgrounds. But the plot also reveals a saving grace 


(8.29) 
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Fig. 8.47. 95% CL limits achievable at LHC on the shifted Higgs triple self-coupling 
(see text), AA, for LHC and SLHC expected luminosities [101]. 


in the destructive interference between triangle and box loop diagrams. If 
spontaneous symmetry breaking isn’t the right description and there is no 
Higgs potential, then À = 0 and the lack of destructive interference gives a 
wildly larger signal cross section, which is far easier to observe. 

Figure 8.47 summarizes the results of Ref. [101]. It plots 95% CL limits 
on the shifted self-coupling, AA = (A—Asm)/Agsm. This is somewhat easier 
to understand: zero is the SM, and -1 corresponds to no self-coupling, or 
no potential. For My > 150 GeV, the LHC can exclude À = 0 at (for some 
My much greater than) 20 with only the LHC. After SLHC running, this 
becomes a 20 — 30% measurement, if other systematics are under control. 
Here, they’re assumed to be smaller than the statistical uncertainty. 

Another potential systematics issue is minimum bias, the presence of 
extra jets in an event which don’t come from the primary hard scatter- 
ing. Here, they could be confused with jets from the W bosons, causing a 
distortion of m,;;. ATLAS has investigated this and found it to not be a 
concern — the shape of Muis for the signal remains largely unaltered [102]. 


HH — bbyy at LHC 


We’ve already ruled out as viable the vast majority of Higgs pair BRs 
for My < 150 GeV due to QCD backgrounds or too-small efficiencies. 


Prd 


Searching for the Higgs Boson 499 


Table 8.3. The major ID efficiencies and fake photon rejection 
factors at LHC. Note the two values for P;—.., which represent 
the current uncertainty in detector capability for fake photon 
rejection. The true value won’t be known until data is collected. 
See Ref. [103] for details. 


7 o 
Ey Eu Pop, Pj ES E a 


LHC | 80% 90% 1/13 1/140 1/1600 1/2500 
SLHC | 80% 90% 1/13 1/23 1/1600 1/2500 


However, the rare decay mode to bbyy is worth a closer look [103]. There 
are many backgrounds to consider, coming from b or c jets plus photons, 
or other jets which fake photons, just as in the single Higgs to photon 
pairs case. Table 8.3 highlights the major ID efficiencies and fake photon 
rejection factors at LHC and SLHC relevant for us. The backgrounds are all 
calculable at LO, but with significant uncertainties, probably a factor of two 
or more. However, that won’t be a concern as we can identify distributions 
useful for measuring the background in the non-signal region. Note that 
with this channel we can completely reconstruct both Higgs bosons. 

The background QCD uncertainties have a work-around. There are two 
angular distributions in the lego plot which look very different for the signal, 
principally because scalars decay isotropically and thus are uncorrelated, 
while the QCD backgrounds have spin correlations. The two distributions 
are shown in Fig. 8.48. The differences are rather dramatic (and even more 
so in 2-D distributions). Tevatron’s experiments CDF and DØ have used 
such a pseudo-sideband analysis for some time to measure a background in 
a non-signal region to normalize their Monte Carlo tools, then extrapolating 
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Fig. 8.48. Angular separations in the lego plot for b jets and photons in gg > HH — 


bbyy signal events and background at the LHC. Figures from Ref. [103]. 
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Table 8.4. Expected event rates after ID efficiencies and all kinematic cuts for bbyy 
events at LHC (SLHC), two detectors and 600(6000) fb~* of data [103]. LHC assumes 
only one b tag, while SLHC requires two. Note the increased fake rate at SLHC. 


[| EH | bby | cery [bbi | cong | ivy | bbij | ccd | 1459 | idii |X (oke) [| S/B | 
[tHe] 6] 2} rtitopPsfPTototrtiyt u [1/2 | 
[SLHC] 21] 6 | of 4}]ot;etitotustiy 2 [14] 


to the signal region to perform a background subtraction. The technique 
is viable because QCD radiative corrections in general do not significantly 
alter angular distributions. 

Table 8.4 summarizes the results of Ref. [103]. It gives event rates ex- 
pected with 600(6000) fb~ of data (two detectors) at LHC(SLHC). SLHC 
would not get ten times as many events because of lower efficiency of hav- 
ing to tag two b jets instead of only one, to overcome the low fake jet 
rejection rate in a high-luminosity environment. First, note that fake b 
jets or fake photons are the largest background: the measurement would 
be significantly hampered by detector limitations. Second, while the $/B 
ratio is excellent, the overall event rate is extremely small, definitely in the 
non-Gaussian statistics regime. 

SLHC could make a useful statement about A, ultimately achieving 
limits on AA of about 40.5, but this is not such a strong statement. It 
could at best generally confirm the SM picture of spontaneous symmetry 
breaking and perhaps rule out wildly different scenarios, but would never 
be particularly satisfying. On the other hand, it’s strong encouragement 
for ATLAS and CMS to push the envelope on tagging efficiency and fake 
rejection, especially for the detector upgrades necessary for SLHC. Doing 
studies like this well ahead of time is useful for this reason, our present case 
being a perfect example. 


8.3.5.2. HH production at an ILC 


While (S)LHC clearly has access to Higgs pair production and thus À for 
My > 150 GeV, it would disappoint at lower masses. We should see if a 
future linear collider could also give a precision measurement for A as it 
could for (most) other Higgs couplings. 

For ete™ collisions below about 1 TeV, double Higgsstrahlung is the 
largest source of Higgs pairs. The Feynman diagrams appear in Fig. 8.49, 
while the cross sections as a function of My for 500 and 800 GeV colli- 
sions [104] are found in Fig. 8.50, which also shows the cross sections times 
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Fig. 8.49. Feynman diagrams for double Higgsstrahlung at a future linear collider, 
ete- — HH. 
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Fig. 8.50. Left: the double Higgsstrahlung cross section as a function of My for 500 
and 800 GeV e*e~ collisions [104]. Right: the cross section times BR at 500 GeV and 
1 TeV ete” collisions, for the dominant final state BRs as a function of My [105]. 


BRs for the dominant final states over the range of Higgs masses. Roughly, 
this corresponds to 4b and 4W final state. The former is very steeply falling 
with My, but the latter is much flatter over the 100-200 GeV mass region, 
suggesting broader access if at all visible. 

The parton-level studies performed so far [105] are fairly encouraging. 
As shown in Fig. 8.51, an ILC could achieve about a 20— 30% measurement 
of À over a broad mass range, with somewhat worse performance around 
My ~ 140 GeV, where the bb and WHW- BRs are roughly equal. Inter- 
estingly, for a lower Higgs mass, the analysis prefers lower machine energy, 
while the opposite is true at least to a small degree at higher mass. This 
is largely a phase space effect for the 3-body production mechanism. Also, 
SLHC is superior for My = 150 GeV (largely due to better statistics), with 
an important caveat: controlling systematics in gg > HH — 4W at LHC 
would require precision input from ILC for the Higgs couplings and BRs. 
This is an excellent example of synergy between experiments. 
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Fig. 8.51. Estimated achievable limits in the shifted self-coupling AA (see Sec. 8.3.5.1) 
at future ete colliders of various energy, as a function of My [105]. 


Fig. 8.52. Representative Feynman diagrams for the WBF process ete~ — voHH. 


Double Higgsstrahlung is not the only source of Higgs pairs at an 
et 
production becomes more and more important. Representative Feynman 
diagrams for ete~ — viHH are shown in Fig. 8.52. A preliminary analy- 
sis [106] for CLIC [107], a second-generation 1 — 5 TeV ete” collider col- 
lecting 5000 fb~!, found rather interesting results, summarized graphically 
in Fig. 8.53. The principal finding is that no matter how high the collision 
energy goes, and regardless of Higgs mass, the precision on A bottoms out 
at 10 — 15%. This is because the self-coupling has an s-channel suppres- 
sion, and its contributions becomes washed out as by other diagrams as ys 


e collider, however. In fact, as the energy increases, WBF Higgs pair 
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Fig. 8.53. The results of Ref. [106] for WBF HH production at CLIC, a second- 
generation multi-TeV e+e~ collider. The plot labels are self-explanatory, while the 
colors are for various Higgs masses: 120 GeV in red, 140 GeV in blue, 180 GeV in green 
and 240 GeV in black. 


increases. A corollary, though, is that CLIC could potentially achieve bet- 
ter precision than SLHC for larger My, although this may be marginal. 
Much more detailed work would be required for both SLHC and CLIC, as 
well as experience at LHC and SLHC to determine its true potential, to 
make conclusive statements. 


8.3.5.3. Electroweak corrections to À 


One final word on the trilinear self-coupling A: Ref. [108] calculated the 
leading 1-loop top quark EW corrections to Asm. Their principal SM 
result is: 


AHHH = Qe? | 3x2 we ME (8.30) 
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The correction is —10%(—4%) for My = 120(180) GeV, non-trivial for 
smaller Higgs masses, but those are excluded in the SM. This correction 
should obviously be taken into account in any future analysis, should the 
Higgs be found. But it should be clear that neither (S)LHC nor ILC will 
be sensitive to it. Even CLIC would have only marginal sensitivity, and 
then only for low Mz. 

Non-minimal Higgs sectors and new physics effects can tell a very dif- 
ferent story, however, as we’ll see, coming up in Secs. 8.4.1 and 8.4.5. 


8.3.5.4. HHH production anywhere 


The trilinear self-coupling A is only part of our phenomenological Higgs 
potential of Eq. 8.28, though. We also need to measure X; the quartic self- 
coupling. In some sense this is equally important to measuring A. Recall the 
structure of the Higgs potential: A allows the global minimum to be away 
from zero, but a non-zero (and positive) À is required to keep the potential 
bounded from below. We can’t really convince ourselves that the potential 
structure of Eq. 8.27 is the right picture without a measurement of both 
these ingredients. We’ve just seen that probing A is extremely challenging. 
Just how difficult is this likely to be for À? 

For e*e~ collisions we already know this is hopeless: the HHH rate 
is both too low and its dependence on À too weak [104]. However, the 
situation at (S)LHC was only very recently investigated [109, 110]. The 
authors calculated the gg — HHH cross section, which involves Feynman 
diagrams like those of Fig. 8.54. Note the appearance of numerous diagrams 
dependent on the trilinear self-coupling, in addition to diagrams dependent 
only on ye. 

The results of the study are shown in Fig. 8.55, for a 200 TeV VLHC. 
They’re rather deflating because the cross section is miserably small. A 
challenge to the student: find a three-Higgs BR to a final state that could 
be observed at a VLHC, where the rate is not laughable. Good luck! In 
addition, the right panel shows that any variation of the trilinear coupling 
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Fig. 8.54. Representative Feynman diagrams for gg — HHH. 
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Fig. 8.55. Left: 200 TeV VLHC gg — HHH cross section as a function of My. Right: 
differential cross section as a function of My p 7 for three values each of A and A. Figures 
from Ref. [109]. 


completely swamps variation of the quartic À, whose own variation is 
already infinitesimal. 

In summary, it appears that we will likely never achieve a complete 
picture of the Higgs potential. This of course applies only to the Standard 
Model. Coming up in Chapter 8.4 we’re going to see that for BSM physics 
the situation is even more discouraging. 


8.4. Beyond-the-SM Higgs sectors 


Now that we know how the Standard Model Higgs sector works — how it 
could be discovered and measured at LHC - it’s natural to think about 
other possibilities for EWSB. The SM Higgs is elegant in its simplicity, but 
as you know from Sally Dawson’s SM lectures, it’s probably too minimal 
— nagging theoretical questions remain about Higgs mass stability, flavor 
(ignoring this is kind of a black eye), neutrino masses (another black eye), 
and so on. Because new physics that could explain dark matter is likely 
to also lie at the TeV scale, most model building makes an attempt to 
incorporate solutions to some of these other problems along with EWSB. 
The literature is vast, but let’s try to roughly classify some of the major 
ideas to get a handle on the variations. 

The broadest two categories of classes are weakly-coupled new physics 
which can be handled with perturbation theory, and strongly-coupled or 
“strong dynamics” models which are penetrable in some cases, others not. 
These include QCD-inspired theories like Technicolor [10, 11] (or more prop- 
erly Extended [12, 13] or Walking [14] Technicolor, which can handle a 
top quark mass very different from the other quark masses) and Topcolor- 
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assisted Technicolor [15] (“TC2”), which incorporates additional weakly- 
coupled gauge structure. Strong dynamics assumes that some TeV-scale 
massive or heavier fermions’ attraction became strong at low energy scales, 
eventually causing their condensation to mesonic states (Technipions, Tech- 
nirho, Technieta, etc.), the neutral scalars of which can incite EWSB via 
their SU (2); gauge interactions. Strong dynamics scenarios are beyond the 
scope of these lectures, however, so I leave it for the interested student to 
study the excellent review article of Ref. [111]. 

While strong dynamics theories are Higgsless in some sense, meaning 
no fundamental scalar fields, the terms is usually reserved for a new class 
of models where the EW symmetry is broken using boundary conditions on 
gauge boson wavefunctions propagating in finite extra dimensions (see e.g. 
Refs. [18, 112]). We’ll also skip these. 

There is far more theoretical effort expended on weakly-coupled EWSB, 
which is mostly variations on what we can add to the single Higgs doublet 
of the SM: 


© 1HDM + invisible (high-scale) new physics, hidden from direct 
detection 

CP-conserving 2HDM: 4 types (minimal supersymmetry, MSSM, 
is Type II) 

CP-violating 2HDM 

Higgs singlet(s) (e.g. next-to-minimal supersymmetry, NMSSM) 
Higgs triplets (often appear in Grand Unified Theories) 

Little Higgs models: SU(2)z x U(1)y is part of larger gauge and 
global group 


© 


0000 


The first item, new high-scale physics hidden from direct detection, sounds 
like a cheat. It actually involves an important aspect of phenomenology: 
effective Lagrangians from higher-dimensional operators. We’ll come back 
to these in a moment. Two Higgs doublets instead of one is an idea with 
multiple sources. For instance, one doublet could give mass to the leptons 
and the other to the quarks, or one to the up-type fermions and the other 
to the down-type, etc. We’ll return to these after effective operators. Ad- 
ditional Higgs singlets likewise have a variety of reasons for being written 
down, but usually it’s just “we can do it, so we will”. We’ll skip these. Higgs 
triplets originated from natural appearance in left-right symmetric GUTs. 
They’re a bit exotic and typically have issues with precision EW data, but 
are interesting in that they predict the existence of doubly-charged Higgs 
states H=, and a tree-level H+W*Z coupling, which must be zero in 
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most Higgs-doublet models. It would therefore stand out experimentally. 
For all these cases I don’t have time to cover, the Higgs Hunter’s Guide is 
the best place to start to learn more [6]. 

Little Higgs theories, on the other hand, are different in that they nec- 
essarily involve new scalar and gauge structure arising from an enlarged 
global symmetry from which the SM emerges, as well as additional mat- 
ter content. Interestingly, in these models the Higgs looks very much like 
the SM Higgs, but with O(v?/F?) corrections, where F is typically a few 
TeV, parametrically 47 larger than the EW scale. The smallness of v?/F? 
could make it very difficult to measure Little Higgs corrections to Higgs ob- 
servables. These models are probably ultimately strongly-coupled at a scale 
A ~ 4rF, but this is an open question. If nature chose this course, the most 
interesting physics is the new gauge boson and matter fields that appears 
at a scale F. Refs. [19] provide nice overviews and simple explanations of 
the two primary Little Higgs mechanisms. 


8.4.1. Higher-dimensional operators 


The new physics responsible for dark matter, flavor, neutrino masses, etc., 
might very well be too massive to produce directly at colliders. This the 
dreaded SM-Higgs-only scenario, where LHC sees nothing new. It would 
really be an invitation to take a more rigorous look at all data — new physics 
effects might still appear as small deviations in precision observables. 

The standard way of parameterizing this is to write down all the possible 
Lagrangian operators with the heavy fields integrated out which preserve 
SU(3).x SU(2), xU(1)y gauge invariance. This was done over two decades 
ago for operators up to dimension six [86]. Although not often emphasized 
in today’s phenomenology, I consider this paper a must-read for all students. 

Let’s begin by considering the possible operators involving only the SM 
Higgs doublet. There are two, of dimension six: 


1 1 


O1 = 5% (616) 0"(GTS6) & O= =. (616)3 (8.31) 
for the effective Lagrangian contribution 
2 fi 
Lep,o = qa i , fi >0 (8.32) 


A must be at least a couple TeV, otherwise we’d likely observe it directly 
at LHC. If you’ve somewhere seen an alternative effective theory for the 
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Higgs potential written as 


AÀ y2\ 2 

n 

Ver = > T (P z =) (8.33) 

the operators written above correspond to the n = 1 term in this expansion. 
O, modifies the Higgs kinetic term, while O2 modifies the EW vev, v: 


1 1 v? uv up fo ve 
eas) L SE ee H —w 2/7, 220 
Lyin = 5 ð p otp + 5 fi me OMOLUO 5 5 (1 ) (8.34) 


where v is what Gr measures. We must also canonically normalize the 
physical Higgs field: 6 = NH with N =1/(1+ fi). 

This results in a number of alterations to masses and couplings [113]. 
First, the Higgs mass itself receives corrections from the expected value, 
given 2: 


fo v 
M?, = 2)v7(1 (: ae me +5, =) (8.35) 


where the f2 term is independent of A. Next, Higgs gauge couplings receive 
v? /A? shifts: 


l 2 fr v? +w- + 
59 1- = |HW; Ww? eee HAW rw? 

. 2 (8.36) 
~—v|1-—-5 | HZ,Z” ->— |1- fiz | HHZ,Z” 
so bs) ? A 12) ” 
Finally, the Higgs boson self-couplings are (phases vary with Feynman rule 
convention): 


3m? fiv?  2fo v v? Qf, v? 
as Gao a 3M2 zor Pi) (837) 


i<j 


2 2 2 2 
Darl = ae (: -fy +r) + E a Spry] (838) 
i<j 
Note that ©ı and ©, both enter here, but more importantly there are 
momentum-dependent terms, which are typical of higher-dimensional op- 
erators. The effect of these terms would be anomalous high-pr Higgses in 
pair production. 
Only one phenomenological analysis exists for these effects, and only 
for precision experiments at a future ILC and CLIC [113]. In this study, 
measurements are expressed in terms of a; = fiv?/A?, since f; and A 
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Fig. 8.56. Achievable uncertainty on measurements of the a, (left) and a2 (right) co- 
efficients of Eq. 8.31 (also see text) at a future ILC for 500 and 800 GeV running [113]. 


can’t be easily separated from so few measurements. Higgsstrahlung, dou- 
ble Higgsstrahlung and WBF Higgs pair production together measure a 
combination of a, and ag. 

Figure 8.56 shows the expected achievable uncertainties (not limits!) 
on a; and az at a future ILC. For fı = 1, this corresponds to a reach in 
A of about 4 TeV, possibly out of the reach of LHC depending on what 
might be directly produced. For fə = 1, however, this corresponds to only 
about A ~ 0.8 TeV, easily accessible at LHC. Put another way, an ILC 
could have access to new high-scale physics via altered Higgs-gauge boson 
couplings, but not via Higgs self-couplings. This is in line with what we’d 
come to expect, as HH production is much smaller. The shapes of the 
uncertainty curves in the figures depend on what values of the operator 
coefficients add to or subtract from the signal, with the added feature that 
the momentum dependence of the Higgs self-couplings that the O1 operator 
introduces changes to kinematic distributions. 

In addition to the Higgs-only D6 operators, there are a handful of op- 
erators involving the Higgs and gauge boson fields together [86]: 


Oww = (¢'d) [WLW + 5W2 W3] 
Ogg = (¢'¢)Byy BY” 
Osw = BY [(gtotd) Wi, + VZT EW t + (TW ;a]] 
Og = (D"$)'(D’¢) Buy 
Ow = (D"¢)' [o (D’6)W3, + V2[T* (D’g)W,t, + T- (D)W; ]] 
Os ı = (D Atp (D"¢) 


These induce momentum-dependent HHVV vertices, so could be studied 
at an ILC or CLIC in the same manner as the Higgs-only couplings, as well 
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as with rare Higgs decays [114], but in general they’re highly constrained 
by EW precisions observables (S, p, gvvv) [115]. Interestingly, it appears 
there has not been an update of the EW constraints on these operators 
since 1997 [116], although there are predictions for limits at an ILC [117]. 
There is, however, a new analysis for WBF Higgs at LHC includes the 
effects of some of these operators and finds that they would be encoded in 
the tagging jet azimuthal separation [118]. 

There is also a set of D6 operators involving the Higgs, fermion and 


gauge boson fields [86]: 


Oag = (¢'¢) (qd) 
Oga = i(6' Dud) (dyd) 


Opa = (Dyqd)D"¢ 
OW = ilo D e) (aa) 


Opp = i(ġteD o) (uy"d) (8.39) 


Oaw = (Go"”o'd)oWi., 
Oga = UG" D,0'd)(G"o%a) 
Opa = (Dd) D" ¢0aB = (Go d) OB 


Some of these are constrained by precise LEP measurements of Zbb, ybb 
couplings, but not severely. They would give interesting rare Higgs decays 
like H — bbZ, bby. Their phenomenology for LHC and even ILC is not 
really studied. Thus, I can’t say to what scale they might be sensitive 
given a SM Higgs discovery with nothing else observed. 


8.4.2. Two-Higgs doublet models (2HDMs) 


The most-often studied extension to the SM Higgs sector is the two-Higgs 
doublet model (2HDM) [6, 119]. That is, we add one additional SU (2)z 
doublet. Both of the doublets acquire a vev. For now let’s assume CP 
conservation and work with in the real-vev basis. Counting degrees of 
freedom, four per complex doublet, and knowing that three modes are 
“eaten” to give the W~ and Z their masses, after SSB there must be five 
physical states. Two of them will necessarily be charged (H=) regardless 
of how we assigned hypercharge to each doublet, leaving the other three 
neutral. Of those, two (h, H) will be CP-even and one will be CP-odd (A), 
the last of which won’t couple to the weak bosons at tree level. The general 
2HDM potential is quite messy [6, 120] , so we’ll not discuss it. 
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Recall the primary role of the Higgs sector: to restore unitarity to weak 
boson scattering. This requires the gauge coupling to WW to be exactly 
igy, where v is what we measure with Gr. In the amplitude, then, 
the coupling squared is gyv’. With two vevs, there is the automatic 
constraint v? + v3 = v? [121]. The ratio is tan 8 = 72. The CP-even mass 


eigenstates, which couple to the weak bosons, thus boil down to simply 
mixing: 


h= V2[-(Red? — v1) sin a + (Reg} — v2) cos a] (8.40) 
H = V2[ (Reg? — v1) cosa + (Reg? — v2) sina] (8.41) 


where a is the angle which diagonalizes the 2 x 2 mixing matrix. The Higgs 
sector is typically defined by a, tan 8 and the potential parameters which 
govern the self-couplings. Some models are defined instead by M4 and Mz. 

Let’s pause for a moment to reflect on what would happen if we in- 
troduced CP violation [119]. This is a well-motivated exercise since there 
isn’t enough CP violation in the SM model to account for baryogenesis 
in the early universe. The most immediate impact is that h, H and A 
now mix. M4 is supposed to parameterize the pseudoscalar pole, but it’s 
now mixed into three physical states, so it becomes ill-defined. Instead, we 
typically use the charged Higgs mass. It would be logical to use M;;+ for 
CP-conserving scenarios as well, but this is one of those historical accidents 
that has too much momentum to change. 

Regarding the fermions, we can apportion the two doublets in four gen- 
eral ways [6]: 


I only ®2 couples to fermions 
II ®; couples to down-type, ®2 to up-type fermions 
III ©, couples to down quarks, ®2 to up quarks and down leptons 


IV ©, couples to quarks, ®2 to leptons 


Types III and IV induce flavor-changing neutral currents (FCNCs), which 
are highly constrained, thus these models are not much studied any more. 
Types I and II are qualitatively different and worth a quick look at the 
differences in their couplings, shown in Table 8.59. Because of which doublet 
gives the down-type fermions their masses, those Yukawa couplings to h and 


4Note that various references use different phase conventions for the Lagrangian. The 
important distinction is the phase between Higgs couplings, and a reference SM cou- 
pling such as eey. I use positive terms in the covariant derivative and drop the overall 
superfluous factor of i typical of most Lagrangians. 
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Table 8.5. Fermion and gauge boson couplings in Type I (upper) 
and II (lower) 2HDMs. 


ð Jbuŭ Jedd JeVV GoZA 
of If IV gV 
cos Q cos @ 1 

h sin a i cos a 
sin 8 sin 8 (6 ) 2 (6 ) 
sina sina 1 

H cos Qa isin Qa 
sin 8 sin 8 (6 ) 2 (6 ) 

A | —iy5 cot 3 iy5 cot 8 0 0) 
cos Q sina 1 

h ——— — sin(G—a —=icos(B—a 
sin 8 cos 8 (8 ) 2 (6 ) 

H sin a cos @ (3 ) ie. (3 ) 

— — cos(G—a -isin(G—-—a 

sin 8 cos 3 2 

A | —iy5 cot 8 | —iy5 tan 8 0 0 


H are swapped between models, with a phase factor from mixing. Similarly, 
the Af f coupling is inverted and changes sign: cot 8 —> — tan 3. The gauge 
coupling for h and H, of course, are unaffected by the Yukawa couplings 
and are fixed to sin(@ — a) and cos(8 — a). (The sum of their squares in 
the amplitude must equal 1!) 

The charged Higgs Yukawa couplings are slightly different yet. The left- 
handed coupling is proportional to the up-type Yukawa coupling, and the 
right-handed coupling the down-type Yukawa, for an out-flowing H~. The 
reverse is true for an outflowing H*. We have: 


GuH-pt = ATA [mu cot B(1 + ys) — mp cot B(1 — 5)] (8.42) 


GH-pu = Sang Lite cot BUA +5) + mp tan p — 95)] (8.43) 


where H~ flows out, D is incoming and U is outgoing. 


8.4.3. Type II 2HDM in the MSSM 


At this point we should focus on the Type I] 2HDM, because that’s the 
one required to appear in the MSSM” (see Ref. [123] for a detailed descrip- 
tion). Model I will have similar features, modulo the couplings swaps given 
in Table 8.5, so is understandable by analogy. We’ll spend the remaining 


TA superpotential can’t be constructed from conjugate fields, else the supersymmetry 
transformations aren’t preserved. For an excellent SUSY tutorial, see Ref. [122]. 
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portion discussing only SUSY Higgs phenomenology, and specifically min- 
imal SUSY, the MSSM. However, by the end it should be apparent that 
extended Higgs sectors may often be treated as variations on a theme, with 
much of the phenomenology based on the same collider signatures. 

The MSSM imposes tree-level constraints on the Higgs potential which 
require the various À to be gauge parameters (MSSM extensions add non- 
gauge terms). We’ll come back to what the potential looks like in Sec. 8.4.5 
and study its phenomenology, and for now simply examine the implication 
of this structure on the mass spectrum. Because we consider only the 
CP-conserving case here, we can get away with using Ma as an input. 
The others will be tan@ as discussed before, the average top squark mass 
Ms, and an encoded trilinear mixing parameter for the top sector, X+. 
This last one is important because of the large top Yukawa corrections 
the MSSM Higgs sector receives. The values 0 and v6 Msg are referred to 
as “no mixing” and “maximal mixing”, because they extremize the loop 
corrections. The h — H mixing angle is 
a + Mż A 


a= 


1 
a= 5 tan! an 26 
to first order. The CP-even masses are given by: 
1 
M,a = 5 (MA + M3 + y (M3 + M3)? + 4M3 M3 sin®(28)) 


3 tog M3 , X2 x? 
— cos? Q m? O ETON = 
872 tarifi r M3\ 2M? 


)| for Mp only 
(8.45) 


where the top Yukawa correction can be significant, a couple tens of GeV. 
The charged Higgs mass is rather more simple: 


Mis = M4 + My (8.46) 


These equations exhibit the interesting property of h decoupling with in- 
creasing pseudoscalar mass: for large Ma the heavy states H, A and H* 
tend to be closely degenerate, and the light h has an asymptotic maximum 
mass which depends mostly on tan 8. We see this behavior, along with a 
plateau effect for Mp and My, in Fig. 8.57. There is always at least one 
CP-even Higgs boson in the mass region 90 S My < 145 GeV, assuming 
perturbativity to high scales. For large M4, toward the decoupling region, 
it is the lighter state, h, but at low Ma it is the heavier state, H. The 
transition region is sharper for larger tan 8. 
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MSSM Higgs boson masses as a function of pseudoscalar mass MA and two 
choices of tan 8, for no (left) and maximal (right) mixing (X; parameter; see text). 
Figures from Ref. [7]. 
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Fig. 8.58. MSSM CP-even Higgs boson couplings to the weak gauge bosons as a function 


of M 4 and for two choices of tan 8, and for no mixing (darker colors) and maximal mixing 
(lighter colors). Figures from Ref. [7]. 


The mass spectrum is not the only feature to exhibit the decoupling 


and transition behavior, however. Both the gauge and Yukawa couplings 
do the same. The VV@ couplings are shown in Fig. 8.58. By comparison 
with Fig. 8.57, we easily see that when either h or H is in its plateau mass 
region, it holds most of the gauge coupling; sin(8—a) — 1 or cos(G—a) > 1. 
In the transition region, the two states share the gauge coupling, and both 


are of comparable importance in unitarity cancellation. As with the mass 
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spectrum, and by now as anticipated, the transition region is sharper for 
larger tan 3. Hold these two figures in your mind, as they are going to play 
an extremely important phenomenological role shortly. 

Using just trigonometry, let’s rewrite the Yukawa couplings of Table 8.5 
to see better how they depend on My and tan 8: 


Jhun = 2o Y, = —[sin(8 — a) + cot 8 cos( 8 — a)| Yu 
sin 8 
sin q : 
Shad = Yq = —[sin(@ — a) — tan 8 cos( 8 — a)] Ya 
cos 3 
sin q (8.47) 
GHua = ET Y„ = —[cos(G — a) — cot 8 sin(8 — a)] Yu 
COS & 
== Ya = -— —a)+t i —a)|Y 
JHad a a cos(3 — a) + tan Gsin(Z — a)| Ya 
This is a far more convenient form, since tan is an input and sin(8 — 


a)/cos(3— a) is the reduced h/H gauge coupling. These are both natural, 
convenient parameters to describe production cross sections and decay par- 
tial widths (thus branching ratios), rather than the CP-even mixing angle 
and sin 2 or cos @, or their inverses. Check Fig. 8.59 to see if you agree. 

These are the most salient features of the MSSM Higgs sector, sufficient 
to understand the bulk of MSSM Higgs phenomenology. For a more in- 
depth discussion, especially of why SUSY imposes these constraints, and 
for more detailed formulae, see Refs. [7, 123]. 

Now that we know the couplings, we can obtain cross sections for h and 
H production simply as correction factors to the SM channels of equal mass. 
There is no WBF or W/Z-associated pseudoscalar production, but there 
is both gg — A inclusive and top quark associated production, ttA, which 
are easily obtained if one inserts the y5 factor into the loop derivation for 
gg — A [7]. The charged Higgs is a special case as there is no SM analogue; 
we'll discuss this in Sec. 8.4.4 in the context of searches. For the moment, 
let’s examine the neutral states’ branching ratios, just to get an idea of how 
they behave. It’s easy to suffer plot overload about now, so don’t try to 
absorb every last detail; focus on the general behavior, which you already 
should be able to guess from the couplings plots. 

Figure 8.60 shows the BRs for the CP-even states h and H, cut off at 
the mass plateaus. They’re basically what we would expect: both h and H 
behave like a SM Higgs of equal mass, except that the various couplings are 
dialed up or down. Mp can never be above ~ 145 GeV, so it almost never 
has a significant BR to gauge bosons. Because the fermionic partial widths 


516 D. Rainwater 


aT 
Son. 


2 
Ihuu 


001E 


F 
tan 8 = 3 ----- E tan 8 = 3 -----. 


tan 8 = 30 ----- J C tang = 30 --- 


0.00 = f Iı ı 1 000 1 I ı Mii 
50 100 150 200 300 500 50 100 150 200 300 500 


Mg [GeV] Ma 


1000 


2 
100 IHdd 


2 
Ihdd 


aaie ee e 


SE teen, 10 


robe 


. ji ji | inf 
50 100 150 200 300 500 50 100 150 200 300 500 
M4 [GeV] M4 [GeV] 


Fig. 8.59. MSSM CP-even Higgs boson couplings to fermions as a function of M4 and 
for two choices of tan 8, and for no mixing (darker colors) and maximal mixing (lighter 
colors). Figures from Ref. [7]. 


can be enhanced by a factor of tan? 3, the rare modes like ¢ > yy, gg tend 
to be suppressed (and the top quark loop can better cancel the W loop 
for some parameter choices, suppressing the partial width). The only new 
features are H — hh, AA decays, possible for limited parameter choices but 
making for interesting additional channels. 

The pseudoscalar BRs behave similarly, as shown in Fig. 8.61. The new 
feature here is at small tan 8, where decays A — hZ are possible. But 
otherwise A prefers to decay ~ 90% to bb and ~ 10% to r*+7~, unless it is 
heavy enough to produce top quark pairs. That dominates only at small 
tan 3 (large cot 3), where the up-type coupling dominates. At large tan ĝ, 
bb and ++7~ both still win by a considerable margin. 

There are similar plots for H~, but they’re not particularly enlightening 
as its decay patterns are drastically simpler: as far as phenomenology is 
concerned, it’s BR~ 1 to tb when kinematically accessible, rv if lighter. 
For low tan 8 there is a rare BR to hW~, but that is predicted to always 
be difficult to observe. 
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Fig. 8.60. MSSM CP-even Higgs boson branching ratios as a function of M, for tan 8 = 
3, 30. Figures from Ref. [7]. 
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Fig. 8.61. MSSM CP-odd Higgs boson branching ratios as a function of M4 for two 
choices of tan 8. Figures from Ref. [7]. 
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All Higgs bosons can decay to SUSY particle pairs if they’re light 
enough, but this is not a very common occurrence across parameter space 
(especially since so much of it is ruled out already by LEP SUSY searches), 
so we'll bypass that discussion here. 


8.4.4. MSSM Higgs searches 


For MSSM Higgs searches past, we start again with LEP. It didn’t find any- 
thing, but placed various limits. Let’s begin with the charged Higgs search, 
because it’s the simplest. This proceeded via H+H- pair production (the 
only mechanism accessible at LEP) and decay to rv or cs, as there was 
never kinematic room for tb. Thus, the search had three channels: dual 
taus, mixed tau plus hadronic decays, and an all-hadronic mode [124]. Be- 
cause the production mechanism depends on only gauge-fixed couplings, 
the MSSM charged Higgs search is usually presented as a more general 
2HDM search, with limits presented in the My+ v. BR(H* — 7*v) plane. 
Fig. 8.62 summarizes the obtained limits. To translate the general search 
limits to the MSSM Higgs sector inputs, recall Eq. 8.46, M2 = M3 +Mĝ. 
The difficulty of this search was the low ID efficiency for taus and charm 
quarks. Unfortunately, there is no final combined limit, but each of the 
collaborations has published final independent limits [126-129]. Watch the 
LEP-Higgs web page for updates [130]. 

The basic neutral Higgs boson search channels are exactly the same as 
in the SM for each of h and H, to which we add ete~ — Z* — hA/HA 
production via the additional couplings of Table 8.5. Each of the four 
LEP collaborations presented a multitude of MSSM h/H/A search limits, 
and there are combined LEP results with CP-conservation [131] and CP- 
violation (CPX) [131, 132]. However, one should be somewhat wary of what 
precisely is presented. The results are usually shown as shaded exclusion 
blobs in either M4-tan 8 space (for a very specific set of additional assump- 
tions) or M;,,-tan 8 space, also given some assumptions. There are literally 


dozens of pages of exclusion plots, depending on what one chooses for the 
mixing parameter X+, top quark mass (recall the strong Mp dependence on 
mz), stop masses, u, and so on. This is far too much to show here, because 
the exclusion contours change so much from assumption to assumption — 
it’s impossible even to select a representative sample without misleading 
the uninitiated. See e.g. Ref. [133]. 

The curious student should flip through the plots in Refs. [131, 133] 
simply to get a feel for how wild this variation is. Observe how much the 
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Fig. 8.62. Left: LEP preliminary combined-experiment charged Higgs search 95%CL 
limits (2001), from Ref. [125]. Right: L3 published limits from 2003, illustrating where 
each of the three decay channels discussed in the text contributes to the overall limit [128]. 
There is no final LEP combined limit, but judging from each of the individual limits [126- 
129], it does not change significantly from the preliminary results. 


contours change depending on the top quark mass — it is obviously still 
fairly poorly measured, as far as fits to supersymmetry go. Note also that 
the plots are always logarithmic in tan 3, which compresses the unexcluded 
large-tan 8 region, making it appear that parameter space is vastly ruled 
out in many cases. This simply isn’t true. Finally, I should comment 
that the “theoretically inaccessible” disallowed blobs are even more grossly 
misleading. All one has to do is move the stop masses up slightly and 
these retreat dramatically. Perhaps a more logical approach is the model- 
independent h/H/A search of OPAL [134]. 

MSSM Higgs Searches at LHC are also mostly variants on the SM search 
channels, the exceptions being charged Higgses, rare (SUSY or Higgs pair) 
decay modes, and one new production channel, bb, which is important at 
large tan 8 where the coupling is enhanced to top-quark Yukawa strength. 
ttġ rates tend to be about the same as the SM for equal mass, or slightly 
suppressed. WBF h or H rates can only be suppressed relative to the SM, 
due to the appearance of sin? (8 — a) or cos?(3 — a), respectively. Inclusive 
rates can change rather dramatically, however, because the b loop can be 
extremely important. Figure 8.63 shows the cross sections for gg — ¢ asa 
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Fig. 8.63. Gluon fusion MSSM Higgs production cross sections at LHC for the CP-even 
states h and H (left) and the pseudoscalar A (right), for two values of tan 8. Figures 
from Ref. [7]. 


function of the physical masses, for small and large tan 8. These may be 
compared with the SM cross sections of Fig. 8.16. 

Let’s concentrate on the WBF modes, however, as they turn out to be 
the most interesting. Recall the plateau behavior of h and H masses as 
a function of Ma (cf. Fig. 8.57), and simultaneously the h and H gauge 
coupling behavior (cf. Fig. 8.58). The astute student will realize that this 
implies that WBF Higgs production in an accessible mass region probably 
always occurs at a good rate, somewhat suppressed but never much so. 
Figure 8.64 summarizes some of this previous information and goes on to 
show the cross section times BR to tau pairs (in the two accessible tau 
decay modes), also as a function of M4 [135]. Indeed, eyeballing the upper 
and lower rows, it appears that between h and H, there’s always a signal 
in WBF. It may be slightly suppressed, but we know from SM WBF Higgs 
studies (cf. Sec. 8.2.3.4) that since so little data is required to make an 
observation, the signal could be suppressed by a factor of several and be 
detectable. The reason is that in the MSSM the h and H plateau mass 
ranges are in the “good” region of WBF Higgs observability. Actually, 
quite a large mass region is observable, but if the MSSM predicted Higgs 
masses closer to the Z pole, there could be trouble (but LEP would already 
have discovered such a Higgs). 

This bit of luck forms the basis of the MSSM Higgs No-Lose Theorem: 
at least one of the CP-even Higgs states, h or H, is guaranteed to be ob- 
servable in WBF at LHC [64, 135]. The original parton-level studies have 
since been confirmed with full ATLAS detector simulation, and actually im- 
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Fig. 8.64. From left to right, each plot as a function of M4: h/H mass, gauge coupling 
suppression factor squared, WBF cross section times BR to taus to the lepton-hadron 
final state, and the same for the dual lepton mode. The upper (lower) row is for h(H). 
Fig. from Ref. [64]. 
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Fig. 8.65. MSSM parameter space coverage of WBF h/H — rtr for the no-mixing 
(A: = 0, left) and maximal mixing (X+ = V6 Msysy, right) cases [135]. 


proved [136]. The parton-level coverage plots shown in Fig. 8.65, however, 
are simpler to grasp. Very little data would be required for discovery, and 
for some M4 it would be possible to observe both h and H simultaneously. 

One caveat: the final state 7*7~ is not always accessible!® It’s pos- 
sible to zero out the MSSM down-type fermion coupling at tree level — 
an interesting exercise for the student. If this happens, h/H — yy and 
h/H — WtW7 are “large” partial widths, so their BRs take up the 
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Fig. 8.66. Feynman diagrams for gg — bbd production at LHC. 


coverage slack [135, 136], saving the No-Lose Theorem. There’s been some 
work on an NMSSM No-Lose Theorem [137-140], which extends the Higgs 
sector by a complex singlet [6]. The outlook for LHC is promising, but not 
obviously rock-solid. 

The No-Lose Theorem is great for the CP-even states, but what about 
the other Higgses? I'll gloss over the bulk of searches, since they’re mostly 
variants on the SM ones, and move on to the special case of heavy H/A 
(towards decoupling) and this new channel bb¢. The Feynman diagrams 
appear in Fig. 8.66. Recall that H has a tan @ enhancement to down-type 
quarks in the decoupling region, and A always has this enhancement. We 
already know that means that H and A prefer to decay 90% of the time 
to bb and 10% to r+7~, but it would be impossible to observe either of 
those final states in inclusive production, and WBF production is zilch for 
H in the decoupling region. However, the LHC being essentially a gluon 
collider, the initial state can create high-energy b pairs, which can then 
Brem a Higgs, either H or A, which are essentially degenerate (but do 
not interfere due to the ys coupling). Since the b jets are produced at 
high-pr, the H/A must recoil against them, so it also produced with a 
transverse boost. It’s decay products are then not back-to-back, allowing 
for tau pair reconstruction; H/A — u™ u~ may also be used, but is a rare 
mode. The final state is then bbr*+7~ (or bbu" u~), which is taggable and 
distinguishable from mixed QCD-EW backgrounds because the tau pair 
invariant mass is in the several-hundred GeV region. 

Figure 8.67 shows the cross section times BR to tau pairs for 300 GeV 
Higgs bosons as a function of tan 3, and also the CMS expected discovery 
reach for various final states in tau or muon pairs, with only 30 fb~! of 
luminosity, or about 1/10 of the total LHC data expected. Coverage is 
not complete, because this mode doesn’t produce enough rate at low tan 8 
where there is little coupling enhancement, but is still a significant search 
tool. The mass resolution achievable for H and A using taus in this mode is 
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Fig. 8.67. Left: bb production cross section at LHC times the BR to tau pairs, as a 
function of tan 8 for M4 = 300 GeV. Right: expected CMS reach using only 30 fb™t of 
data for bbH/A — r+7~, utp" as a function of Ma. Figures from Ref. [141]. 


even pretty good, on the order of a couple tens of GeV, possibly better. Of 
course, if the decay to muons is accessible (at very large tan G, then mass 
resolution would be on the order of a GeV. 

This would determine M4 quite well, good enough for comparison with 
theory (at least at first), but what about the other major Higgs param- 
eter, tan 8? The bbd production rate is directly proportional to tan? 2, 
so we can measure it using the overall rate, with the mild (but not rock 
solid) assumption that the ratio of bb and t+r~ BRs is the ratio of the 
b and 7 squared masses, i.e. that BR(H/A — ttr~)~ 10% [141]. The 
major sources of uncertainty are this assumption, the machine luminos- 
ity uncertainty of 5 — 10%, PDF uncertainties of probably about 5%, and 
higher-order QCD corrections to the production process of probably about 
20% [142, 143]. 

Figure 8.68 shows the CMS expected uncertainty on tan 8 using this 
method, as a function of M4 and for 30 or 60 fb~! of data. In general, 
10 — 20% appears achieveable. This is not spectacular, but would be a sig- 
nificant first step toward sorting out the new Higgs sector and presumably 
comparing to other SUSY discovery measurements. Clearly the higher- 
order QCD uncertainties dominate, which could probably be improved with 
better theoretical calculations over the next decade. This will be done if 
heavy Higgses are discovered. 

Now, what about charged Higgs discovery? We know nothing about its 
phenomenology, because there is no SM analogue. All we do know is the 


524 D. Rainwater 


30, 70, 


Q fe A, = 2450 GeV/ z H Wifi = E Small error bars: stat errors 
æ p = eVe eee susy? T> = p Large error bars: total uncertainty 
c N 2 ERSS inè E =5% 
Z a 2 'W= S00 'GeVie > a ao A, = 2450 GeV/c’, u = 300 GeV/c? Aalsa aok 
S | M200 Gevie Aolo = 20% E My = 200 GeV/c, mysy = 1 TeVic? “BRUBR= 3% 
< PA Musy = 1 TeV/c? ABR/BR = 3% E 
i Total uncertainty cok MOU UL 
E E ME E Ltt 
15 
f wot WUT 
[ Eo 
ele sor.  GELLLLLLLLELELLELETE 
E COO EEEHEENE ey 4 
fa a E OMS, 60 fb 
t Stat errors oe “E ee Hgysy>tteplli/jj 
plete i Te eet ee Cairina een d e nace death 


o 


0 100 200 300 400 500 600 700 800 oe 100 200 300 400 500 600 700 800 Pie 
m,(GeV/c’) m,(GeV/c’) 


Fig. 8.68. CMS expected precision on tan 3 at LHC using bb¢ production as described 
in the text. Figures from Ref. [141]. 


very important fact that, despite everything else we may see at Tevatron 
or LHC, the only way to prove the existence of two Higgs doublets is to 
directly observe the charged Higgs states. I cannot emphasize this enough. 
For all we know, an extra neutral state might simply be the residue of an 
extra Higgs singlet; there could be more to the flavor sector that confuses 
us when we try to measure Yukawa couplings or tan 8. Thus, observing the 
H= states would be a huge qualitative step toward understanding what the 
Higgs sector is. How would this proceed experimentally? 

At Tevatron there is very little energy available for direct charged Higgs 
production, since it must be produced in association with a top quark 
(large coupling), as shown in the Feynman diagrams of Fig. 8.69. However, 
if My+ is small enough, the top quark can decay to bH~ followed by 


TOE ETCE > Tysa 
zg : t ™, b 1 


IOTA I 


Fig. 8.69. Feynman diagrams for charged Higgs production at hadron colliders. The 
short line breaking the b quark propagator represents how the process may also be 
regarded as initiated by a b parton in the proton, rather than from gluon splitting to a 
b quark pair. 
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Fig. 8.70. Branching ratio for top quark to bottom quark plus charged Higgs boson, as 
a function of M;;+ for a few select values of tan 8 (left) and as a function of tan 8 for 
My,;+ = 120 GeV (right). Figures from Ref. [7]. 


H= — rv if tan > 1, and equally to be and cs if tan@ < 1; if Mys Z 
120 GeV, then the BR to W+*bb via a top quark loop becomes significant. 
Figure 8.70 shows the t — bH= BR as a function of My+ for a few select 
tan 9, and as a function of tan 8 for My=+ = 120 GeV. At low tan J, the 
partial width is driven mainly by the top quark Yukawa, while at large 
tan 8 it’s primarily the bottom quark. Weakness of both Yukawas in the 
intermediate-tan @ regime results in a comparatively reduced top quark 
partial width (recall Eqs. (8.42,8.43)). For fixed M7;+, the partial width is 
symmetric in log(tan 8) about a minimum at tan 8 = y m/m. Charged 
Higgs decays to hW= or AW= are generally disallowed in the MSSM from 
LEP mass limits on h and A. 

The Tevatron search proceeds both as appearance (i.e. looking directly 
for H* in the top quark sample) and disappearance, or missing rate for 
top quark to bW=. Figure 8.71 goes on to show the expected 95% CL 
limits in the Mj;+ — tan @ plane that Tevatron Run I achieved, and Run II 
might reach depending on how much data it ultimately records. The very 
slight change between 2 and 10 fb~! reveals that the experiments there are 
statistics-limited, but not by a great margin. 

LHC will search for tH* direct production (Fig. 8.69), covering the 
mass range My+ > m. Due to nasty QCD backgrounds, the tb decay will 
be inaccessible [145], leaving rv with BR~ 10%. This is very difficult due 
to a subtlety of tau decays. Left-handed taus decay to soft leptons [146]. 
Since neutrinos are left-handed, helicity conservation in scalar decay means 
all taus are as well. We need a lepton to trigger the event, and it must come 


526 D. Rainwater 


M(H*) (GeV) 


Fig. 8.71. Tevatron Run I 95% CL charged Higgs mass limits (double hatched lines) as a 
function of tan @ from searches for top quark decays to bottom quark plus charged Higgs, 
and expected limits achievable in Run II (single hatched lines for 2 fb~!, unhatched 
curves for 10 fb~'). Figure from Ref. [144]. 


from H~ instead of t, so that there is only one source of missing transverse 
momentum and we can fully reconstruct t, and H= transversely. Only a 
small fraction of the small rate could pass the necessary detector kinematic 
cuts to be recorded. This limits the search to large tan 8 or small My+, 
where the production rate is largest. Fig. 8.72 shows ATLAS’s expected 
transverse mass distributions for a fairly light and a heavy H=. 

Finally, we come to the overall picture of MSSM Higgs phenomenology 
at LHC. Primarily we’re concerned with discovering all the states, but es- 
pecially the charged Higgs as it’s the key to confirming the existence of 
two Higgs doublets. That turns out to be extraordinarily difficult due to 
a combination of factors, from overwhelming QCD backgrounds to charac- 
teristics of left-handed tau decays. Figure 8.73 summarizes the reach for 
h, H, A and H* [70]. It’s reassuring that the No-Lose Theorem holds 
and we’re guaranteed to find at least one of the CP-even states, h or H. 
However, moderate tan 3 and the decoupling limit (large MA) both present 
significant gaps in coverage to observe any of the additional states. This is 
especially more apparent once one realizes that the region below the solid 
black curve is already excluded by LEP, so those LHC access regions don’t 
matter. The figure is from 2001 and needs updating — some significant 
positive changes exist — but the general picture remains. 
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Fig. 8.72. Expected transverse mass distributions for light and heavy H* — tv at 
ATLAS [147]. 
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8.4.5. MSSM Higgs potential 


I’ve touched on the bits of Higgs gauge and Yukawa couplings in the MSSM 
that are qualitatively different that the SM: My and tan 8. But we should 
look at self-couplings more closely, because in a general 2HDM (or the sub- 
set MSSM) they are radically different. First, because there are more Higgs 
bosons, there are more self-couplings — six for the neutral states alone, to 
be precise: Ahhh, Ahh; AHEh; AHHH, ARAA; AHAA. In the MSSM these 
are all equal to M#?/v times various mixing angles (which aren’t partic- 
ularly enlightening so I don’t show them) plus additional shifts from top 
quark Yukawa loop corrections. That is, they are all (mostly) gauge param- 
eters. However, in the large-M, decoupling limit which recovers the SM, 
Ahhh — ASM- 

If we discover SUSY, we’d start by assuming it’s the MSSM. To mea- 
sure the MSSM potential in that case, we’d have to observe at least six 
different Higgs pair production modes to measure the six self-couplings. 
(Note that I’m leaving out the possible self-couplings involving charged 
Higgses.) Inclusive Higgs pair production looks generally like it does in the 
SM, gg — ¢1¢2 via triangle and box loop diagrams as shown in Fig. 8.74, 
but the b quark loops become important and must be included. 
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Fig. 8.73. Summary of MSSM Higgs boson discovery reaches at LHC (and extended 
to SLHC via the solid red line), combining ATLAS and CMS, in the tan 8 — MA plane 
in the maximal mixing scenario. The reach is defined as 5ø discovery in at least one 
production and decay channel. Below the solid black curve is the region excluded by 
LEP. Figure from Ref. [70]. 


Fig. 8.74. Feynman diagrams for Higgs pair production in a 2HDM like the MSSM. 
The loops include both top and bottom quarks, and there are six possible processes 
(see text). 


Unfortunately, the box diagram totally swamps the one containing the 
self-coupling we care about by a factor tan? 8, and in any case backgrounds 
from H/Abb production appear to be overwhelming [103]: very generally, 
LHC would not obtain any À measurements at all. The one very limited 
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Fig. 8.75. Resonant MSSM Higgs pair production at LHC and decay to bbyy final 
states [103]. 


exception is that LHC could clearly observe Higgs pair production if it 
came from resonant heavy Higgs decay, H/A — hh. An example peak is 
shown in Fig. 8.75. However, this would measure only a BR, at best, not an 
absolute coupling. Sadly, exactly the same situation exists for Higgs pairs 
at a future ILC [104]. 


8.5. Conclusions 


The purpose of these lectures has not been to provide exhaustive coverage 
of all aspects of collider Higgs phenomenology. Rather, it’s a solid intro- 
duction, focusing on the basics. This includes SM production and decay, 
mostly at LHC, where we’re confident we could discovery a SM-like Higgs, 
and many non-SM-like variants. I focused on the most important channels 
which guarantee discovery, and especially in weak boson fusion (WBF) as 
those are the most powerful (best S/B, distinctive) search channels, cover- 
ing the broadest range of Higgs mass. I emphasized that our understanding 
of LHC Higgs physics has changed dramatically from the days of the AT- 
LAS TDR, for example, which is now quite obsolete. However, ATLAS has 
produced a plethora of Notes and summaries of Notes to cover the changes, 
and CMS published a fresh TDR [48] in 2006 which covers the changes 
as well. 

We now understand the LHC to be such a spectacular Higgs factory 
that not only can it discover any mass of SM-like Higgs boson, it can also 
do an impressive job of measuring all its quantum properties. Granted, 
Higgs couplings measurements won’t be precision-level if the Higgs is light, 
as expected from EW precision data, but they would nonetheless be ab- 
solute couplings measurements. The LHC can even make significant steps 


530 D. Rainwater 


toward measuring the SM Higgs potential, at least the Higgs trilinear self- 
coupling, although depending on Mp it may require precision gauge and 
Yukawa couplings input from a future ete~ collider (an ILC) to control 
the major systematic uncertainties. I also highlighted where an ILC could 
make improvements to the LHC’s measurements, and where it would be 
vital to filling in gaps in LHC results. 

The final third of the lectures discussed BSM Higgs sectors, but only 
the 2HDM MSSM Higgs sector in any detail. Many SM Higgs sector ex- 
tensions are rather simple variants on SM phenomenology, involving fac- 
torizable changes in production and decay rates (couplings), mostly arising 
from mixing angles. This is not general, however, and there are plenty 
of “exotic” models — Higgs triplets, for example — which would be qualita- 
tively different, but therefore simultaneously distinctive. The popular focus 
on the MSSM 2HDM is because of several other outstanding questions in 
particle physics, like dark matter or the theoretical dirty laundry of the SM 
Higgs sector, which strongly motivate the other new physics. 

Students who wish to engage in Higgs phenomenology research should 
definitely take the time to expand their scope beyond the SM and the 
MSSM. Other extensions are equally well-motivated, such as Little Higgs, 
not to mention strong dynamics. But the two well-studied basic models I 
covered here give one a strong foundation for other BSM Higgs phenomenol- 
ogy by analogy. Happy Higgs hunting! 
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Chapter 9 


Z’ Phenomenology and the LHC 


Thomas G. Rizzo 


Stanford Linear Accelerator Center, 
2575 Sand Hill Rd., Menlo Park, CA, 94025, 
rizzo@slac.stanford. edu 


A brief pedagogical overview of the phenomenology of Z’ gauge bosons is 
presented. Such particles can arise in various electroweak extensions of 
the Standard Model (SM). We provide a quick survey of a number of Z’ 
models, review the current constraints on the possible properties of a Z’ 
and explore in detail how the LHC may discover and help elucidate the 
nature of these new particles. We provide an overview of the Z’ studies 
that have been performed by both ATLAS and CMS. The role of the 
ILC in determining Z’ properties is also discussed. 


9.1. Introduction: What is a Z’ and What is It Not? 


To an experimenter, a Z’ is a resonance, which is more massive than the SM 
Z, observed in the Drell-Yan process pp(pp) > ItI~ + X, where l=e, and, 
sometimes, 7, at the LHC(or the Tevatron). To a theorist, the production 
mechanism itself tells us that this new particle is neutral, colorless and self- 
adjoint, i.e., it is its own antiparticle. However, such a new state could still 
be interpreted in many different ways. We may classify these possibilities 
according to the spin of the excitation, e.g., a spin-0 Č in R-parity violating 
SUSY, a spin-2 Kaluza-Klein(KK) excitation of the graviton as in the 
Randall-Sundrum(RS) model,” or even a spin-1 KK excitation of a SM 
gauge boson from some extra dimensional model.*° Another possibility 
for the spin-1 case is that this particle is the carrier of a new force, a new 
neutral gauge boson arising from an extension of the SM gauge group, i.e., 
a true Z’, which will be our subject below. Given this discussion it is 
already clear that once a new Z’-like resonance is discovered it will first be 
necessary to measure its spin as quickly as possible to have some idea what 
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kind of new physics we are dealing with. As will be discussed below this 
can be done rather easily with only a few hundred events by measuring the 
dilepton angular distribution in the reconstructed Z’ rest frame. Thus, a 
Z’ is a neutral, colorless, self-adjoint, spin-1 gauge boson that is a carrier 
of a new force.* 

Once found to be a Z’, the next goal of the experimenter will be to 
determine as well as possible the couplings of this new state to the particles 
(mainly fermions) of the SM, i.e., to identify which Z’ it is. As we will see 
there are a huge number of models which predict the existence of a Z’.®® 
Is this new particle one of those or is it something completely new? How 
does it fit into a larger theoretical framework? 


9.2. Z’ Basics 


If our goal is to determine the Z’ couplings to SM fermions, the first ques- 
tion one might ask is ‘How many fermionic couplings does a Z’ have?’ 
Since the Z’ is a color singlet its couplings are color-diagonal. Thus (al- 
lowing for the possibility of light Dirac neutrinos), in general the Z’ will 
have 24 distinct couplings-one for each of the two-component SM fields: 
UL, dLi, VLi, €L; + (L — R) with i = 1 — 3 labeling the three generations. 
(Of course, exotic fermions not present in the SM can also occur but we 
will ignore these for the moment.) For such a generic Z’ these couplings are 
non-universal, i.e., family-dependent and this can result in dangerous flavor 
changing neutral currents(FCNC) in low-energy processes. The constraints 
on such beasts are known to be quite strong from both K —K and Bas—Ba,s 
mixing? as well as from a large number of other low-energy processes. There 
FCNC are generated by fermion mixing which is needed to diagonalize the 
corresponding fermion mass matrix. As an example, consider schemati- 
cally the Z’ coupling to left-handed down-type quarks in the weak basis, 
i.e., dy, nd}, Z', with n; being a set of coupling parameters whose different 
values would represent the generational-dependent couplings. For simplic- 
ity, now let 71,2 = a and 73 = b and make the unitary transformation to 
the physical, mass eigenstate basis, dy, = U;jd,,;. Some algebra leads to 
FCNC couplings of the type ~ (b — a)d,,U},Us;dz, Z’. Given the existing 
experimental constraints, since we expect these mixing matrix elements to 
be of order those in the CKM matrix and a,b to be O(1), the Z’ mass must 
be huge, ~ 100 TeV or more, and outside the reach of the LHC. Thus un- 


4Distinguishing a Z’ from a spin-1 KK excitation is a difficult subject beyond the scope 
of the present discussion.” 
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less there is some special mechanism acting to suppress FCNC it is highly 
likely that a Z’ which is light enough to be observed at the LHC will have 
generation-independent couplings, i.e., now the number of couplings is re- 
duced: 24 — 8 (or 7 if neutrinos are Majorana fields and the RH neutrinos 
are extremely heavy). 

Further constraints on the number of independent couplings arise from 
several sources. First, consider the generator or ‘charge’ to which the Z’ 
couples, T’. Within any given model the group theory nature of T’ will be 
known so that one may ask if [T’, T;] = 0, with T; being the usual SM weak 
isospin generators of SU (2)z. If the answer is in the affirmative, then all 
members of any SM representation can be labeled by a common eigenvalue 
of T’. This means that uz and dz, i.e., QT = (u,d)z, as well as vzr and ez, 
i.e., LT = (v,e)z (and dropping generation labels), will have identical Z’ 
couplings so that the number of independent couplings is now reduced from 
8 — 6(7 — 5). As we will see, this is a rather common occurrence in the 
case of garden-variety Z’ which originate from extended GUT groups® such 
as SO(10) or Eg. Clearly, models which do not satisfy these conditions lead 
to Z’ couplings which are at least partially proportional to the diagonal SM 
isospin generator itself, i.e., T’ = aT} . 

In UV completed theories a further constraint on the Z’ couplings arises 
from the requirement of anomaly cancellation. Anomalies can arise from 
one-loop fermionic triangle graphs with three external gauge boson legs; 
recall that fermions of opposite chirality contribute with opposite signs to 
the relevant ‘VVA’ parts of such graphs. In the SM, the known fermions 
automatically lead to anomaly cancellation in a generation independent 
way when the external gauge fields are those of the SM. The existence of 
the Z’, together with gauge invariance and the existence of gravity, tells 
us that there are 6 new graphs that must also vanish to make the theory 
renormalizable thus leading to 6 more constraints on the couplings of the 
Z’. For example, the graph with an external Z’ and 2 gluons tells us that 
the sum over the colored fermion’s eigenvalues of T’ must vanish. We can 
write these 6 constraints as (remembering to flip signs for RH fields) 


Dy nS De ee 


colortriplets,i isodoublets,i 


Syn) YTP =0 (9.1) 
SX T=) T =0, 
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where here we are summing over various fermion representations. These 6 
constraints can be quite restrictive, e.g., if T” Æ aTr + bY, then even in 
the simplest Z’ model, vr (not present in the SM!) must exist to allow for 
anomaly cancellation. More generally, one finds that the existence of new 
gauge bosons will also require the existence of other new, vector-like (with 
respect to the SM gauge group) fermions to cancel anomalies, something 
which happens automatically in the case of extended GUT groups. It is 
natural in such scenarios that the masses of these new fermions are compa- 
rable to that of the Z’ itself so that they may also occur as decay products 
of the Z’ thus modifying the various Z’ branching fractions. If these modes 
are present then there are more coupling parameters to be determined. 


9.3. Z-Z’ Mixing 


In a general theory the Z’ and the SM Z are not true mass eigenstates due to 
mixing; in principle, this mixing can arise from two different mechanisms. 

In the case where the new gauge group G is a simple new U(1)’, the 
most general set of SU (2)z x U(1)y x U(1)’ kinetic terms in the original 
weak basis (here denoted by tilded fields) is 

1 a v ls PHV Ls, FIV sin X 5, PUV 

Eran WaW = 2B BY — A E BY, (9.2) 
where sinx is a parameter. Here W/ is the usual SU(2); gauge field 
while Bu, Zi, are those for U(1)y and U(1)’, respectively. Such gauge ki- 
netic mixing terms can be induced (if not already present) at the one- 
loop level if Tr(T’Y) # 0. Note that if G were a nonabelian group 
then no such mixed terms would be allowed by gauge invariance. In 
this basis the fermion couplings to the gauge fields can be schemati- 
cally written as f(gLTaW® + gyYB + §zT'Z')f. To go to the phys- 
ical basis, we make the linear transformations B —> B — tan xZ’ and 
Ž! — Z'/cosy which diagonalizes £x and leads to the modified fermion 
couplings f|gL Ta W° +gy Y B+ gz (T’ + 6Y)Z'|f where gz = gz: / cos x and 
ô = —gy tan y/gz. Here we see that the Z’ picks up an additional coupling 
proportional to the usual weak hypercharge. 6 Æ 0 symbolizes this gauge 
kinetic mixing!? and provides a window for its experimental observation. 
In a GUT framework, being a running parameter, 6(Meur) = 0, but can 
it can become non-zero via RGE running at lower mass scales if the low 


energy sector contains matter in incomplete GUT representations. In most 
models’? where this happens, |6(~ TeV)| < 1/2. 
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Z-Z' mixing can also occur through the conventional Higgs-induced SSB 
mechanism (7.e., mass mixing) if the usual Higgs doublet(s), H; (with vevs 
Up; ), are not singlets under the new gauge group G. In general, the breaking 
of G requires the introduction of SM singlet Higgs fields, S; (with vevs vs, ). 
These singlet vevs should be about an order of magnitude larger than the 
typical doublet vevs since a Z’ has not yet been observed. As usual the 
Higgs kinetic terms will generate the W, Z and Z’ masses which for the 
neutral fields look like 


2 
5 (2 T3LZ + oT’) v»,| + X [gz T'vs, Z]? $ (9.3) 
i a j 
where Cw = cosOw. (Note that the massless photon has already been 
‘removed’ from this discussion.) The square of the first term in the first 
sum produces the square of the usual SM Z boson mass term, ~ M?Z?. 
The square of the last term in this sum plus the square of the second sum 
produces the corresponding Z’ mass term, ~ M2,Z'?. However, the ZZ’ 
interference piece in the first sum leads to Z-Z’ mixing provided T’H; 4 0 
for at least one 7; note that the scale of this cross term is set by the doublet 
vevs and hence is of order ~ M3. 

This analysis can be summarized by noting that the interaction above 
actually generates a mass (squared) matrix in the ZZ’ basis: 


M2 M2 

M? = a PME, (9.4) 
6M2 M}? 

Note that the symmetry breaking dependent parameter p, 


4Cwgz 
g= tse [E nereh, I Soe, (9.5) 
JL i i 


can be argued to be O(1) or less on rather general grounds. Since this 
matrix is real, the diagonalization of M? proceeds via a simple rotation 
through a mixing angle ¢, i.e., by writing Z = Z,cos¢@ — Zosin®g, etc, 
which yields the mass eigenstates Z1,2 with masses Mj 2; given present 
data we may expect r = M?/M2 < 0.01 — 0.02. Zı ~ Z is the state 
presently produced at colliders, i.e., Mı = 91.1875+ 0.0021 GeV, and thus 
we might also expect that ¢ must be quite small for the SM to work as well 
as it does. Defining p = M?/M?, with Mz being the would-be mass of the 
Z if no mixing occurred, we can approximate 


o = —6r[1+ (1+ B?)r + O(r?)] (9.6) 
õp = Br[1 + (1+ 267)r + O(r?)], 
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where dp = p — 1, so that 8 determines the sign of ¢. We thus expect that 
both dp, || < 107?. In fact, if we are not dealing with issues associated 
with precision measurements!! then Z-Z’ mixing is expected to be so small 
that it can be safely neglected. 

It is important to note that non-zero mixing modifies the predicted SM 
Z couplings to (Tsz — twQ)cg + gzT’s¢, where tw = sin? 0w, which 
can lead to many important effects. For example, the partial width for 
Z, — ff to lowest order (i.e., apart from phase space, QCD and QED 


radiative corrections) is now given by 


pGr M? (visp + azp) 


I'(Z, > ff) =Ne 9.7 
where Ne is a color factor, p is given above and 
92! , 1 
vef = (ToL — 2rwQ)ce + ———~(T, + Tr)s 9.8 
ff ( ) o aa. L R) $ ( ) 
gz' , , 
aeft = T3106 + — == (T; — Th)so , 
ff $ gJL/(2cw) L R}? ¢ 


and where T; p are the eigenvalues of T’ for fz,r. Other effects that can 
occur include decay modes such as Zə — WtW~,Z,H;, where H; is a 
light Higgs, which are now induced via mixing. If T’ has no T; component 
this is the only way such decays can occur at tree level. In the case of 
the Z2 — WtW~— mode, an interesting cancellation occurs: the partial 
width scales as s3 (M2 /Mw)*, where the second factor follows from the 
Goldstone Boson Equivalence Theorem.!? However, since sọ ~ —6r and 
r = M?/M3 ~ M2/M3, we find instead that the partial width goes as 
~ 8B? without any additional mass enhancement or suppression factors. 
The tiny mixing angle induced by small r has been offset by the large 
Mə/Mw ratio! In specific models, one finds that this small Z-Z’ mixing 
leads to Z2 = WtW- partial widths which can be comparable to other 
decay modes. Of course, Z2 ~ W*W can be also be induced at the one- 
loop level but there the amplitude will be suppressed by the corresponding 
loop factor as well as possible small mass ratios. 


9.4. Some Sample Z’ Models 


There are many (hundreds of) models on the market which predict a Z’ 
falling into two rather broad categories depending on whether or not they 
arise in a GUT scenario. The list below is only meant to be representative 
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and is very far from exhaustive and I beg pardon if your favorite model is 
not represented. 

The two most popular GUT scenarios are the Left Right Symmetric 
Model (LRM)? and those that come from E¢ grand unification.’ 

(i) In the Eg case one imagines a symmetry breaking pattern Eg —> 
SO(10) x U(1)y => SU(5) x UA), x UA) y. Then SU(5) breaks to the SM 
and only one linear combination G = U(1)9 = coU (1)y — soU (1)y remains 
light at the TeV scale. @ is treated as a free parameter” and the partic- 
ular values 0 = 0, — 90°, sin~! ,/(3/8) ~ 37.76° and —sin~! ,/(5/8) ~ 
—52.24°, correspond to ‘special’ models called Y, x,7 and I, respectively. 
These models are sometimes referred to in the literature as effective rank-5 
models (ER5M). In this case, neglecting possible kinetic mixing, 


gL [Sew ( Quco 2) 
gee N pals = l 9.9 
4 Cw V 3 ( 2v6 2/10 (98) 


where A œ 1 arises from RGE evolution. The parameters Qy, originate 
from the embeddings of the SM fermions into the fundamental 27 repre- 
sentation of Es. A detailed list of their values can be found in the second 
paper in® with an abbreviated version given in the Table below in LH field 
notation. Note that this is the standard form for this embedding and there 
are other possibilities. These other choices can be recovered by a shift in 
the parameter 0. Note further that in addition to the SM fermions plus the 
RH neutrino, Eg predicts, per generation, an additional neutral singlet, S°, 
along with an electric charge Q = —1/3, color triplet, vector-like isosinglet, 
h, and a color singlet, vector-like isodoublet whose top member has Q = 0, 
H (along with their conjugate fields). These exotic fermions with masses 
comparable to the Z’ cancel the anomalies in the theory and can lead to 
interesting new phenomenology® but we will generally ignore them in our 
discussion below. In many cases these states are quite heavy and thus will 
not participate in Z’ decays. 

(ii) The LRM, based on the low-energy gauge group SU(2);, x SU(2) rx 
U(1)p_1, can arise from an SO(10) or Eg GUT. Unlike the case of ER5M, 
not only is there a Z’ but there is also a new charged WẸ gauge boson since 
here G = SU(2). In general x = gr/gL # 1 is a free parameter but must 
be > «w/(1—«w) for the existence of real gauge couplings. On occasions, 
the parameter arr = \/c2,K?/a7, — 1 is also often used. In this case we 


bThe reader should be aware that there are several different definitions of this mixing 
angle in the literature, i.e., Z! = Zy cos 8 + zy sin 8 occurs quite commonly. 
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Table 9.1. Quantum numbers 
for various SM and exotic 
fermions in LH notation in Ee 
models 


Representation Qy Qx 


Q 1 —1 
L 1 3 
us 1 —1 
d° 1 3 
ef 1 —1 
vt 1 —5 
H —2 -2 
H° —2 2 
h —2 2 
he —2 -2 
S° 4 0 


find that 


guT' = Zk? (1 +6? )ew]? [ewTan +K?(1—-rw)Tsr-zwQ]. (9.10) 


The mass ratio of the W’ and Z’ is given by 


BEE a, SE SO (9.11) 

Mg,  Kk?(1-— zw)-— zw 
with the values pr = 1(2) depending upon whether SU(2)pr is broken by 
either Higgs doublets (or by triplets). The existence of a W” = Wr with 
the correct mass ratio to the Z’ provides a good test of this model. Note 
that due to the LR symmetry we need not introduce additional fermions in 
this model to cancel anomalies although right-handed neutrinos are present 
automatically. In the Eg case a variant of this model'* can be constructed 
by altering the embeddings of the SM and exotic fermions into the ordinary 
10 and 5 representations (called the Alternative LRM, i.e., ALRM). 

(iii) The Z’ in the Little Higgs scenario!’ provides the best non-GUT ex- 
ample. The new particles in these models, i.e., new gauge bosons, fermions 
and Higgs, are necessary to remove at one-loop the quadratic divergence 
of the SM Higgs mass and their natures are dictated by the detailed group 
structure of the particular model. This greatly restricts the possible cou- 
plings of such states. With a W’ which is essentially degenerate in mass 
with the Z’, the Z’ is found to couple like gy T’ = (g,/2)T3r cot 04, with 
0y another mixing parameter. 
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(iv) Another non-GUT example!” is based on the group SU(2); x 
SU(2), x U(1)y with l,h referring to ‘light’ and ‘heavy’. The first 2 
generations couple to SU(2); while the third couples to SU(2),. In this 
case the Z’ and W’ are again found to be degenerate and the Z’ couples to 
gz T” = gr|cot ®T3;-tan PT3] with ® another mixing angle. Such a model 
is a good example of where the Z’ couplings are generation dependent. 

(v) A final example is a Z’ that has couplings which are exactly the 
same as those of the SM Z (SSM), but is just heavier. This is not a real 
model but is very commonly used as a ‘standard candle’ in experimental 
Z’ searches. A more realistic variant of this model is one in which a Z’ 
has no couplings to SM fermions in the weak basis but the couplings are 
then induced in the mass eigenstate basis Z-Z’ via mixing. In this case the 
relevant couplings of the Z’ are those of the SM Z but scaled down by a 
factor of sin ¢. 

A nice way to consider rather broad classes of Z’ models has recently 
been described by Carena et al..'® In this approach one first augments the 
SM fermion spectrum by adding to it a pair of vector-like (with respect 
to the SM) fermions, one transforming like L and the other like d°; this is 
essentially what happens in the kg GUT model. The authors then look for 
families of models that satisfy the six anomaly constraints with generation- 
independent couplings. Such an analysis yields several sets of 1-parameter 
solutions for the generator T” but leaves the coupling gz free. The simplest 
such solution is T’ = B— zL, with x a free par meter. Some other solutions 
include T” = Q + zur (i.e., T’(Q) = 1/3 and T’(ur) = x«/3 and all others 
fixed by anomaly cancellation), T” = dp — xur and T’ = 10 + x5, where 
‘10’ and 5 refer to SU(5) GUT assignments. 


9.5. What Do We Know Now? Present Z’ Constraints 


Z' searches are of two kinds: indirect and direct. Important constraints 
arise from both sources at the present moment though this is likely to 
change radically in the near future. 


9.5.1. Indirect Z’ Searches 


In this case one looks for deviations from the SM that might be associ- 
ated with the existence of a Z’; this usually involves precision electroweak 
measurements at, below and above the Z-pole. The cross section and for- 
ward backward asymmetry, Arg, measurements at LEPII take place at 
high center of mass energies which are still (far) below the actual Z’ mass. 
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Since such constraints are indirect, one can generalize from the case of 
a new Z’ and consider a more encompassing framework based on contact 
interactions.!® Here one ‘integrates out’ the new physics (since we assume 
we are at energies below which the new physics is directly manifest) and 
express its influence via higher-dimensional (usually dim-6) operators. For 


+ 


example, in the dim-6 case, for the process ete” — ff, we can consider an 


effective Lagrangian of the form!’ 


An 
L = Lsm + mG 


A2(1 + des) S hy Gees) fr" fs) (9.12) 


ij=L,R 

where A is called ‘the compositeness scale’ for historic reasons, def takes care 
of the statistics in the case of Bhabha scattering, and the 7’s are chirality 
structure coefficients which are of order unity. The exchange of many new 
states can be described in this way and can be analyzed simultaneously. 
The corresponding parameter bounds can then be interpreted within your 
favorite model. This prescription can be used for data at all energies as 
long as these energies are far below A. 

Z-pole measurements mainly restrict the Z-Z’ mixing angle as they are 
sensitive to small mixing-induced deviations in the SM couplings and not 
to the Z’ mass. LEP and SLD have made very precise measurements of 
these couplings which can be compared to SM predictions including radia- 
tive corrections.1' An example of this is found in Fig. 9.1 where we see 
the experimental results for the leptonic partial width of the Z as well as 
sin? Olepton in comparison with the corresponding SM predictions. Devia- 
tions in sin? lepton are particularly sensitive to shifts in the Z couplings due 
to non-zero values of ¢. Semiquantitatively these measurements strongly 
suggest that |¢| < a few 1073, at most, in most Z’ models assuming a light 
Higgs. Performing a global fit to the full electroweak data set, as given, 
e.g., by the LEPEWWG!"! gives comparable constraints.® 

Above the Z pole, LEPII data provides strong constraints on Z’ cou- 
plings and masses but are generally insensitive to small Z-Z’ mixing. Writ- 
ing the couplings as >, fy (uf —af,75)fZ? for i = 7, Z,Z’, the differential 
cross section for ete~ — ff when my = 0 is just 


do Ne 3 
dz 32ns > Pij lB (l + 2°) + 2Cy2], (9.13) 


where 


Bij = (vivg + aiaj)e(vivj + aiaj)p (0.14) 


Cig = (viaj + aivj)e(viaj + aivy)y , 
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Measurement Fit pioen 
Ao (m,) 0.02758 + 0.00035 0.02766 |m 
mz [GeV] 91.1875+0.0021 91.1874 
T,[GeV] 2.4952+0.0023 2.4957 = 
0.233 A i : Crag [nb] 41.5404 0.037 41.477 pm 
B m= 171.4 + 2.1 GeV R, 20.767 +0.025 20.744 pamm 
my=114..1000GeV 4 ao 0.01714 + 0.00095 0.01640 mmm 
AP.) 0.1465 + 0.0032 0.1479 jm 
Ry 0.21629 + 0.00066 0.21585 pmm 
k 0.1721 +0.0030 0.1722 
a AY 0.0992+ 0.0016 0.1037 > 
fee Ane 0.0707 + 0.0035 0.0741 pamm 
aq A, 0.923 + 0.020 0.935 pm 
5 A, 0.670 + 0.027 0.668 
A(SLD) 0.1513+0.0021 0.14797 pmm 
sin?esr(Q,,) 0.2824+0.0012 0.2314 pmm 
my [GeV]  80.392+ 0.029 80.371 pmm 
0.231 Tw[GeV] 2.147 + 0.060 2.091 pmm 
m, [GeV] 171.4+2.1 171.7 M 
0 1 2 3 


83.6 838 84 84.2 
T, [MeV] 


(a) (b) 


Fig. 9.1. Summer 2006 results from the LEPEWWG. (a) Fit for the Z leptonic par- 
tial width and sin? Olepton in comparison to the SM prediction in the yellow band. 
(b) Comparison of a number of electroweak measurements with their SM fitted values. 


and 


(s — M?)(s — M?) + iT ;MiM; 


Py = 9° oe 
-MP +e 


(9.15) 


with ys the collision energy, T; being the total widths of the exchanged 
particles and z = cos 0, the scattering angle in the CM frame. Arg for any 
final state fermion f is then given by the ratio of integrals 


i dz% -= f°, dz% 


o (9.16) 


If the e~ beams are polarized (as at the ILC but not at LEP) one can also 
define the left-right polarization asymmetry, Af r; to this end we let 


Bi; > Bij E(viaj t A4V;)e(UiV; + AjQ;) f (9 17) 


Cig > Cag + €(vivg + ais e(viag + a5) 5 , 
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and then form the ratio 


do(€ = +1) — do(€ = —1) 


f = 
Azr(2) Sp "n + " 


: (9.18) 
where P is the effective beam polarization. 

For a given Z’ mass and couplings the deviations from the SM can then 
be calculated and compared with data; since no obvious deviations from 
the SM were observed, LEPII" places 95% CL lower bounds on Z’ masses 
of 673(481, 434, 804, 1787) GeV for the x(w,7,LRM(« = 1),SSM) models 
assuming \ = 1. Note that since we are far away from the Z’ pole these 
results are not sensitive to any particular assumed values for the Z’ width 
as long as it is not too large. 

The process ete~ — WtW- can also be sensitive to the existence of a 
Z’, in particular, in the case where there is some substantial Z-Z’ mixing.?° 
The main reason for this is the well-known gauge cancellations among the 
SM amplitudes that maintains unitarity for this process as the center of 
mass energy increases. The introduction of a Z’ with Z-Z’ mixing induces 
tiny shifts in the W couplings that modifies these cancellations to some 
extent and unitarity is not completely restored until energies beyond the 
Z’ mass are exceeded. As shown by the first authors in Ref. 20, the leading 
effects from Z-Z’ mixing can be expressed in terms of two s—dependent 
anomalous couplings for the WW y and WW Z vertices, i.e., gwwy = e(1 + 
ô) and gwwz = e(cot 0w +6z) and inserting them into the SM amplitude 
expressions. The parameters ô, z are sensitive to the Z’ mass, its leptonic 
couplings, as well as the Z-Z’ mixing angle. In principle, the constraints on 
anomalous couplings from precision measurements can be used to bound 
the Z’ parameters in a model dependent way. However, the current data 
from LEPII" is not precise enough to get meaningful bounds. More precise 
data will, of course, be obtained at both the LHC and ILC. 

The measurement of the W mass itself can also provides a constraint 
on 6p since the predicted W mass is altered by the fact that Mz 4 Mz. 
Some algebra shows that the resulting mass shift is expected to be My = 
57.67.25 MeV. Given that My is within œ~ 30 MeV of the predicted SM 
value and the current size of theory uncertainties,?! strongly suggests that 
bp < a few 10-3 assuming a light Higgs. This is evidence of small r and/or 
GB if a Z’ is actually present. 

Below the Z pole many low energy experiments are sensitive to a Z’. 
Here we give only two examples: (i) The E-158 Polarized Moller scat- 
tering experiment’? essentially measures Ayr which is proportional to a 
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coupling combination ~ —1/2 + 2xefp where xef = tw+t‘new physics’. 
Here xy is the running value of sin? 0w at low Q? which is reliable cal- 
culable. For a Z’ (assuming no mixing) the ‘new physics’ piece is just 


Wai ae au, al, which can be determined in your favorite model. Given the 


data,?? rere — cw = 0.0016 0.0014, one finds, e.g., that Mz, > 960A GeV 
at 90% CL. (ii) Atomic Parity Violation(APV) in heavy atoms measures 
the effective parity violating interaction between electrons and the nucleus 
and is parameterized via the ‘weak charge’, Qw, which is again calculable 
in your favorite model: 


M2 
= ey, Aa i[vu,(2Z + N) + va,(2N + Z)], (9.19) 


= —N + Z(1 — 4rw)+ a Z’ piece, in the limit of no mixing; here the sum 
extends over all neutral gauge bosons. The possible shift, AQw, from the 
SM prediction then constrains Z’ parameters. The highest precision mea- 
surements from Cs!8° yield?’ AQw = 0.45 + 0.48 which then imply (at 
95% CL) Mz, > 1.05\ TeV and Mz,,,,, > 0.67 TeV for x = 1. Note that 
though both these measurements take place at very low energies, their rel- 
ative cleanliness and high precision allows us to probe TeV scale Z’ masses. 
Figure 9.2 shows the predicted value of the running sin” 0w? together with 
the experimental results obtained from E-158, APV and NuTeV.*4 The ap- 
parent ~ 30 deviation in the NuTeV result remains controversial but is at 
the moment usually ascribed to our lack of detailed knowledge of, e.g., the 
strange quark parton densities and not to new physics. 


9.5.2. Direct Z’ Searches 


In this case, we rely on the Drell-Yan process at the Tevatron as mentioned 
above. The present lack of any signal with an integrated luminosity ap- 
proaching ~ 1 fb~! allows one to place a model-dependent lower bound on 
the mass of any Z’. The process pp — ItI~ + X at leading order arises 
from the parton-level subprocess gq — I*I~ which is quite similar to the 
ete — ff reaction discussed above. The cross section for the inclusive 
process is described by 4 variables: the collider CM energy, ys, the invari- 
ant mass of the lepton pair, M, the scattering angle between the q and 
the 17, 0*, and the lepton rapidity in the lab frame, y, which depends on 
its energy (E) and longitudinal momentum(p,): y = $ log ES tel, For a 
massless particle, this is the same as the SIEN n. With these 
variables the triple differential cross section for the Drell-Yan process is 


550 Thomas G. Rizzo 


sino (Q) 


0.24 


Czarnecki 
Marciano 
(2000) 


Qw(Cs) 


PDG2004 


10 1 10 10 


10° 
Q (GeV) 


Fig. 9.2. A comparison by E-158 of the predictions for the running value of sin? 0w 
with the results of several experiments as discussed in the text. 


given by (z = cos 0*) 
do _ K(M) 
dM dy dz  48nxM3 


y [seta FAJFA (9.20) 
q 


where K is a numerical factor that accounts for NLO and NNLO QCD 
corrections? as well as leading electroweak corrections?’ and is roughly of 
order ~ 1.3 for suitably defined couplings, 


Gy = LaTbld(Ta, M? )g(2», M?) £ q(x», M?)G(2a, M?)| ’ (9.21) 


are products of the appropriate parton distribution functions (PDFs), with 
Za bp = Me*"/,/s being the relevant momentum fractions, which are evalu- 
ated at the scale M? and 


Sq = X Pij(s > M)Bis(f > 9) 
ij 
Aq = 5 P;;(s = M?)Ci;(f — q) 


aj 


(9.22) 


with B,C and P as given above. In order to get precise limits (and to mea- 
sure Z’ properties once discovered as we will see later), the NNLO QCD 
corrections play an important role?® as do the leading order electroweak ra- 
diative corrections.?® Apart from the machine luminosity errors the largest 
uncertainty in the above cross section is due to the PDFs. For M <1 TeV 
or so these errors are of order ~ 5%°° but grow somewhat bigger for larger 


Z' Phenomenology and the LHC 551 


invariant masses: ~ 15(25)%3! for M = 3(5) TeV. As a point of comparison 
the corrected SM predictions for the W and Z production cross sections at 
the Tevatron are seen to agree with the data from both CDF and DO at the 
level a few percent.” 

It is somewhat more useful to perform some of the integrals above in 
order to make direct comparison with experimental data. To this end we 


define (for our LHC discussion below) 


do+ re fe do 
dM dy — y +f] dM dy dz’ 28) 


mes Y —Ymin do? 
U T + Í a, (9.24) 
dM i _y  |dM dy 


min 


and subsequently 


Here Y is cut representing the edge of the central detector acceptance(~ 1.1 
for the Tevatron detectors and ~ 2.5 for those at the LHC) with zo = 
min{tanh(Y —|y]|), 1] being the corresponding angular cut. Ymin is a possible 
cut employed to define the Z’ boost direction which we will return to below. 
As in the case of ete” collisions above, one can define an Arp(M) = 
do~ /do*. 

A Z’, being a weakly interacting beast, generally has a rather narrow 
width to mass ratio, i.e., [2,/MZ, << 1; e.g., in the case of the SM Z 
this ratio is ~ 1078. This being the case, almost the entire Z’ event rate 
comes from a rather narrow window of M values: M ~ Mz +21 7z,, or so. 
In this limit we can approximate the resonance as a 6-function in M and 
drop all of the SM contributions to the sums above. In this case, pieces of 
the P;; that go as, e.g., M4/|(M? — M2) + iMzTz,|? can be replaced by 
Z(M — My) Me, up to [?,/M¢, corrections, so that integrals over M can 
be performed analytically (since the integral over the PDFs is now just a 
constant factor). In such a limit, the contribution to the cross section for 
I*I— production from the Z’ is just oz B(Z’ — I*I~) with oz being the 
integrated value of the cross section at M = Mz), i.e., at the Z’ peak, and 
B being the leptonic branching fraction of the Z’. This is called the Narrow 
Width Approximation (NWA). In a similar way, Arg on the Z’ pole in the 
NWA is just the ratio do~ /do* evaluated at Mz; note that this ratio does 
not depend upon what decay modes (other than leptonic) that the Z’ might 
have. Also note that in the NWA, the continuum Drell-Yan background 
makes no contribution to the event rate. This is a drawback of the NWA 
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1/N dN/dz 


Fig. 9.3. Normalized leptonic angular distribution predicted from the decay of particles 
with different spin produced in gg annihilation. The dashed(solid,dotted) curves are for 
spin-0(2,1). The generated data corresponds to 1000 events in the spin-2 case. 


since it is sometimes important to know the height of the Z’ peak relative 
to this continuum to ascertain the Z’ signal significance. 

It is evident from the above cross section expressions that the Z’ (as 
well as y and Z) induced Drell-Yan cross section involves only terms with 
a particular angular dependence due to the spin-1 nature of the exchanged 
particles. In the NWA on the Z’ pole itself the leptonic angular distribution 
is seen to behave as ~ 1+27+8Aprpz/3, which is typical of a spin-1 particle. 
If the Z’ had not been a Z’ but, say, a in an R-parity violating SUSY 
model! which is spin-0, then the angular distribution on the peak would 
have been z-independent, i.e., flat(with, of course, Arg = 0). This is 
quite different than the ordinary Z’ case. If the Z’ had instead been an 
RS graviton? with spin-2, then the qq — I*I~ part of the cross section 
would behave as ~ 1 — 3z? + 424, while the gg — I"I part would go as ~ 
1—z*, both parts also yielding Apg = 0. These distributions are also quite 
distinctive. Figure 9.3 shows an example of these (normalized) distributions 
and demonstrates that with less than a few hundred events they are very 
easily distinguishable. Thus the Z’ spin should be well established without 
much of any ambiguity given sufficient luminosity. 

An important lesson from the NWA is that the signal rate for a Z’ 
depends upon B, the Z’ leptonic branching fraction. Usually in calculating 
B one assumes that the Z’ decays only to SM fields. Given the possible 
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Fig. 9.4. (a) The Drell-Yan distribution as seen by CDF. (b) CDF cross section lower 
bound in comparison to the predictions for the Z’ in the SSM. 


existence of SUSY as well as the additional fermions needed in extended 
electroweak models to cancel anomalies this assumption may be wrong. 
Clearly Z’ decays to these other states would decrease the value of B making 
the Z’ more difficult to observe experimentally. 

At the Tevatron only lower bounds on the mass of a Z’ exist. These 
bounds are obtained by determining the 95% CL upper bound on the pro- 
duction cross section for lepton pairs that can arise from new physics as 
a function of M(= Mz). (Note that this has a slight dependence on the 
assumption that we are looking for a Z’ due to the finite acceptance of the 
detector.) Then, for any given Z’ model one can calculate oz B(Z’ > It1~) 
as a function of Mz, and see at what value of Mz the two curves cross. 
At present the best limit comes from CDF although comparable limits are 
also obtained by D0.** The left panel in Fig. 9.4 shows the latest (summer 
2006) Drell-Yan spectrum from CDF; the right panel shows the correspond- 
ing cross section upper bound and the falling prediction for the Z’ cross 
section in the SSM. Here we see that the lower bound is found to be 850 
GeV assuming that only SM fermions participate in the Z’ decay. For other 
models an analogous set of theory curves can be drawn and the associated 
limits obtained. 

Figure 9.5 shows the resulting constraints (from a different CDF anal- 
ysis?” with a lower integrated luminosity but also employing the Arg ob- 
servable above the mass of the SM Z) on a number of the models discussed 
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Littlest Higgs Z’ cot 877=0.3]| cot 877=0.5|cot 0y =0.7|cot 0y =1.0 


E6Z' Model — | Zy | Zy | Zy | Zr | Zn | Zsec - 
Exp. Mz, limit (GeV/c?) 625 765 835 910 


Exp. limit (GeV/¢?)|735 710) 675 Obs. Mz, limit (GeV/c?)| 625 760 830 900 
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Fig. 9.5. Experimental lower bounds from CDF on a number of Z’ models: (a) Ee 
models (b) Little Higgs models. 


above all assuming Z’ decays to SM particles only and no Z-Z’ mixing. 
Looking at these results we see that the Tevatron bounds are generally su- 
perior to those from LEPII and are approaching the best that the other 
precision measurements can do. These bounds would degrade somewhat if 
we allowed the Z’ to have additional decay modes; for example, if B were 
reduced by a factor of 2 then the resulting search reach would be reduced 
by 50-100 GeV depending on the model. 

The Tevatron will, of course, be continuing to accumulate luminosity 
for several more years possibly reaching as high as 8 fb7! per experiment. 
Assuming no signal is found this will increase the Z’ search reach lower 
bound somewhat, ~ 20%, as is shown in Fig. 9.6 from?®. At this point the 
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Fig. 9.6. Extrapolation of the Z’ reach for a number of different models at the Tevatron 
as the integrated luminosity increases. Results from CDF and D0 are combined. 
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search reach at the Tevatron peters out due to the rapidly falling parton 
densities leaving the mass range above ~ 1 TeV for the LHC to explore. 


9.6. The LHC: Z’ Discovery and Identification 


The search for a Z’ at the LHC would proceed in the same manner as at the 
Tevatron. In fact, since the Z’ has such a clean (i.e., dilepton) signal and 
a sizable cross section it could be one of the first new physics signatures to 
be observed at the LHC even at relatively low integrated luminosities.?” °° 
Figure 9.7 shows both the theoretical anticipated 95% CL lower bound and 
the 5c discovery reach for several different Z’ models at the LHC for a single 
leptonic channel as the integrated luminosity is increased; these results are 
mirrored in detectors studies.*° Here we see that with only 10—20 pb~! the 
LHC detectors will clean up any of the low mass region left by the Tevatron 
below 1 TeV and may actually discover a 1 TeV Z’ with luminosities in the 
30—100 pb! range! In terms of discovery, however, to get out to the ~ 4—5 
TeV mass range will requite ~ 100 fo~! of luminosity. At such luminosities, 
the 95% CL bound exceeds the 5a discovery reach by about 700 GeV. In 
these plots, we have again assumed that the Z’ leptonic branching fraction 
is determined by decays only to SM fermions. Reducing B by a factor of 2 
could reduce these reaches by ~ 10% which is not a large effect. 

The Z’ peak at the LHC should be relatively easy to spot since the SM 
backgrounds are well understood as shown?®4! in Fig. 9.8 for a number of 
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Fig. 9.7. (a) 95% CL lower bound and (b) 5ø discovery reach for a Z’ as a function 
of the integrated luminosity at the LHC for w(red), x(green), ņn(blue), the LRM with 
k = 1(magenta), the SSM(cyan) and the ALRM(black). Decays to only SM fermions is 
assumed. 
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Fig. 9.8. Resonance shapes for a number of Z’ models as seen by ATLAS assuming 
Mz = 1.5 TeV. The continuum is the SM Drell-Yan background. 


different Z’ models. The one problem that may arise is for the case where 
the Z’ width, Tz, is far smaller than the experimental dilepton pair mass 
resolution, 6M. Typically in most models, Tz /Mz is of order ~ 0.01 which 
is comparable to dilepton pair mass resolution, 6M/M, for both ATLAS 
and CMS.” If, however, rz /Mz << M/M, then the Z’ resonance is 
smeared out due to the resolution and the cross section peak is reduced by 
roughly a factor of ~ ['z/6M making the state difficult to observe. This 
could happen, e.g., if the Z’ (before mixing with the SM Z) had no couplings 
to SM fields.“4 

Given the huge mass reach of the LHC it is important to entertain the 
question of how to ‘identify’ a particular Z’ model once such a particle is 
found. This goes beyond just being able to tell the Z’ of Model A from 
the Z’ from model B. As alluded to in the introduction, if a Z’-like object 
is discovered, the first step will be to determine its spin. Based on the 
theoretical discussion above this would seem to be rather straightforward 
and studies of this issue have been performed by both ATLAS and CMS.*° 
Generally, one finds that discriminating a spin-1 or spin-2 object from one 
of spin-0 requires several times more events than does discriminating spin-2 
from spin-1. The requirement of a few hundred events, however, somewhat 
limits the mass range over which such an analysis can be performed. If a 
particular Z’ model has an LHC search reach of 4 TeV, then only for masses 
below ~ 2.5 — 3 TeV will there be the statistics necessary to perform a 
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Fig. 9.9. (a) The theoretical predictions for 1.5 TeV SSM Z’ and RS graviton resonance 

shapes at ATLAS in comparison to the graviton signal data. (b) Differentiation, in ø, 


of spin-1 and spin-2 resonances at CMS as a function of the number of events assuming 
a 1.5 TeV mass. 


reliable spin determination. Figure 9.9 shows two sample results from this 
spin analysis. For the ATLAS study in the left panel*® the lepton angular 
distribution for a weakly coupled 1.5 TeV KK RS graviton is compared 
with the expectation for a SSM Z’ of identical mass assuming a luminosity 
of 100 fb-!. Here one clearly sees the obvious difference and the spin-2 
nature of the resonance. In the right panel*® the results of a CMS analysis 
is presented with the distinction of a 1.5 TeV Z’ and a KK graviton again 
being considered. Here one asks for the number of events (N) necessary 
to distinguish the two cases, at a fixed number of standard deviations, ø, 
which is seen to grow as (as it should ) with VN. For example, a 30 
separation is seen to require ~ 300 events. 

Once we know that we indeed have a spin-1 object, we next need 
to ‘identify’ it, i.e., uniquely determine its couplings to the various SM 
fermions. (Note that almost all LHC experimental analyses up to now have 
primarily focused on being able to distinguish models and not on actual 
coupling extractions.) We would like to be able to do this in as model- 
independent a way as possible, e.g., we should not assume that the Z’ 
decays only to SM fields. Clearly this task will require many more events 
than a simple discovery or even a spin determination and will probably 
be difficult for a Z’ with a mass much greater than ~ 2 — 2.5 TeV unless 
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Table 9.2. Results on oj; and oy x Iz, for all studied models from ATLAS. 
Here one compares the input values from the generator with the reconstructed 
values obtained after full detector simulation. 


ay, (fb) a, °° (fb) arte X Trec (fb.GeV) 

SSM 78.4+0.8 78.5+1.8 3550+137 

p 22.6+0.3 22.7+0.6 166415 

M = 1.5 TeV X 47.5+0.6 48.4+1.3 800+47 
n 26.2+0.3 24.6+0.6 212+16 

LR 50.8+0.6 51.141.3 1495+72 

M=A4Tev SSM  0.16+0.002 0.16-+0.004 19+1 

KK 2.2+0.07 2.2+0.12 331435 


integrated luminosities significantly in excess of 100 fb~! are achieved (as 
may occur at the LHC upgrade.*”) Some of the required information can 
be obtained using the dilepton (i.e., ete~ and/or yt u~) discovery channel 
but to obtain more information the examination of additional channels will 
also be necessary. 

In the dilepton mode, three obvious observables present themselves: 
(i) the cross section, ou, on and below the Z’ peak (it is generally very 
small above the peak), (ii) the corresponding values of Arg and (iii) the 
width, Iz, of the Z’ from resonance peak shape measurements. Recall that 
while Arg is B insensitive, both oy. and ['z are individually sensitive to 
what we assume about the leptonic branching fraction, B, so that they 
cannot be used independently. In the NWA, however, one sees that the 
product of the peak cross section and the Z’ width, oul z, is independent 
of B. (Due to smearing and finite width effects, one really needs to take the 
product of dot /dM, integrated around the peak and Tz.) Table 9.2 from 
an ATLAS study*® demonstrates that the product oul z, can be reliably 
determined at the LHC in full simulation, reproducing well the original 
input generator value. 

Let us now consider the quantity Arg. At the theory level, the angle 
0* employed above is defined to be that between the incoming q and the 
outgoing /~. Experimentally, though the lepton can be charge signed with 
relative ease, it is not immediately obvious in which direction the initial 
quark is going, i.e., to determine which proton it came from. However, since 
the q valence distributions are ‘harder’ (i.e., have higher average momentum 
fractions) than the ‘softer’ g sea partons, it is likely? that the Z’ boost 
direction will be that of the original g. Of course, this is not always true 
so that making this assumption dilutes the true value of Arg as does, e.g., 
additional gluon radiation. For the Z’ to be boosted, the leptons in the 
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Fig. 9.10. (a) Arg near a 1.5 TeV Z’ in a number of models. (b) On-peak differentiation 
of Eg models using Arp showing statistical errors for a 1.5 TeV Z’. 


final state need to have (significant) rapidity, hence the lower bound in the 
integration of the cross section expression above. Clearly, a full analysis 
needs to take these and other experimental issues into account. 

The left panel of Fig. 9.10 shows? Arg as a function of M in the re- 
gion near a 1.5 TeV Z’ for Eg model ņ in comparison with the predictions of 
several other models. Here we see several features, the first being that the 
errors on App are rather large except on the Z’ pole itself due to relatively 
low statistics even with large integrated luminosities of 100 fb~1; this is 
particularly true above the resonance. Second, it is clear that Arg both on 
and off the peak does show some reasonable model sensitivity as was hoped. 
From the right panel”? of Fig. 9.10 it is clear that the various special case 
models of the Eg family are distinguishable. This is confirmed by more 
detailed studies performed by both ATLAS*® and CMS.*! Figure 9.11 from 
CMS?®! shows how measurements of the on-peak App can be used to dis- 
tinguish models with reasonable confidence given sufficient statistics (and 
in the absence of several systematic effects). Table 9.3 from the ATLAS 
study*® shows that the original input generator value of the on-peak Arg 
can be reasonably well reproduced with a full detector simulation, taking 
dilution and other effects into account. 

If a large enough on-peak data sample is available, examining Arg as a 
function of the lepton rapidity°? can provide additional coupling informa- 
tion. The reason for this is that u and d quarks have different x distributions 
so that the weight of uu and dd induced Z’ events changes as the rapidity 
varies. No detector level studies of this have yet been performed. 
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Fig. 9.11. CMS analysis of Z’ model differentiation employing Arg assuming Mz, = 1 
or 3 TeV. 


Table 9.3. Measured on-peak App for all studied models in the central mass 
bin from ATLAS. Here the raw value obtained before dilution corrections is 
labeled as ‘Observed’. 


Model | f L(fb~*) Generation Observed Corrected 
1.5 TeV 
SSM 100 +0.088 + 0.013 +0.060 + 0.022 | +0.108 + 0.027 
x 100 0.386 + 0.013 0.144 + 0.025 | —0.361 + 0.030 
n 100 0.112 + 0.019 0.067 + 0.032 | —0.204 + 0.039 
n 300 0.090 + 0.011 0.050 + 0.018 | —0.120 + 0.022 
w 100 +0.008 + 0.020 0.056 + 0.033 | —0.079 + 0.042 
Y 300 +0.010 + 0.011 0.019 + 0.019 | —0.011 + 0.024 
LR 100 +0.177 + 0.016 0.100 + 0.026 0.186 + 0.032 
4TeV 
SSM 10000 0.057 + 0.023 0.001 + 0.040 0.078 + 0.051 
KK 500 0.491 + 0.028 0.189 + 0.057 0.457 + 0.073 


Off-peak measurements of Arg are also useful although in this case 
systematics are more important; as shown in the ATLAS study,*® whose 
results are shown in Table 9.4, it is more difficult to reproduce the input 
generator value of this quantity than in the on-peak case. 

There are, of course, other observables that one may try to use in the 
dilepton channel but they are somewhat more subtle. The first possibil- 
ity”? is to reconstruct the Z’ rapidity distribution from the dilepton final 
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Table 9.4. Measured off peak, 0.8 < M < 1.4 TeV, Arg for all studied models 
from ATLAS using the same nomenclature as above. 


Model | f L(fb~*) Generation Observed Corrected 
1.5 TeV 
SSM 100 +0.077 + 0.025 +0.086 + 0.038 0.171 + 0.045 
x 100 +0.440 + 0.019 +0.180 + 0.032 | +0.354 + 0.039 
n 100 +0.593 + 0.016 +0.257 + 0.033 0.561 + 0.039 
w 100 +0.673 + 0.012 +0.294 + 0.033 | +0.568 + 0.039 
LR 100 +0.303 + 0.022 +0.189 + 0.033 0.327 + 0.040 
Shape of the different quark fractions Rapidity distribution 
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Fig. 9.12. (a) Rapidity distributions for different gg induced events. (b) Rapidity dis- 
tribution differentiation of Z’ models. 


state. The left panel of Fig. 9.12 reminds us that the Z’ rapidity distribu- 
tion produced by only wu, dd or sea quarks would have a different shape. 
The particular Z’ couplings to quarks induce different weights in these three 
distributions and so one may hope to distinguish models in this way. An 
example of this is shown in the right panel of Fig. 9.12. The first anal- 
ysis? of this type considered the quantity Rua, the fraction of Z’ events 
originating from uu, as an observable; a similar variable Rgg can also be 
constructed. Figure 9.13 from a preliminary ATLAS analysis”? compares 
the values of these two parameters extracted via full reconstruction for a 
1.5 TeV Z’; here we see that reasonable agreement with the input values of 
the generator are obtained although the statistical power is not very good. 
Knowing both Rj ,q and the ratio of the dd and uu parton densities fairly 
precisely, one can turn these measurements into a determination of the 
coupling ratio (v,2 + a,2)/(v2 + a2). 
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Fig. 9.13. Comparison of Rqq values determined at the generator level and after detector 
simulation by ATLAS. 


A second possibility is to construct the rapidity ratio in the region 
near the Z’ pole: 


yı d 
SEn dy W 


IEAS eal 


(9.25) 


Here yı is some suitable chosen rapidity value œ 1. R essentially measures 
the ratio of the cross section in the central region to that in the forward 
region and is again sensitive to the ratio of u and d quark couplings to the 
Z’. A detector level study of this observable has yet to be performed. 

In addition to the ete~ and pty discovery channel final states, one 
might also consider other possibilities, the simplest being rtr. Assuming 
universality, this channel does not provide anything new unless one can 
measure the polarization of the r’s, P,, on or very near the Z’ peak.” 
The statistics for making this measurement can be rather good as the rate 
for this process is only smaller than that of the discovery mode by the 7 
pair reconstruction efficiency. In the NWA, P, = 2val,/(u2 + a2), as- 
suming universality, so that the ratio of v/,/ai, can be determined uniquely. 
Figure 9.14 shows, for purposes of demonstration, the value of P, in the Ee 
model case where we see that it covers its fully allowed range. 

A first pass theoretical study®> suggests that ôP, ~ 1.5//N, with N 
here being the number of reconstructed 7 events. Even for a reconstruction 
efficiency of 3%, with Mz not too large ~ 1—1.5 TeV, the high luminosity 
of the LHC should be able to tell us P, at the +0.05 level. It would be 
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Fig. 9.14. 7 polarization asymmetry for a Z’ in Eg models in the NWA. 


very good to see a detector study for this observable in the near future to 
see how well the LHC can really do in this case. 

Once we go beyond the dileptons, the next possibility one can imagine 
is light quark jets from which one might hope to get a handle on the Z’ 
couplings to quarks. The possibility of new physics producing an observable 
dijet peak at the LHC has been studied in detail by CMS°°; the essential 
results are shown in Fig. 9.15. Here we see that for resonances which are 
color non-singlets, i.e., those which have QCD-like couplings, the rates are 
sufficiently large as to allow these resonances to be seen above the dijet 
background. However, for weakly produced particles, such as the SSM Z’ 
shown here, the backgrounds are far too large to allow observation of these 
decays. Thus it is very unlikely that the dijet channel will provide us with 
any information on Z’ couplings at the LHC. 

Another possibility is to consider the heavy flavor decay modes, i.e., 
Z’ — bb or tt. Unfortunately, these modes are difficult to observe so that it 
will be quite unlikely that we will obtain coupling information from them. 
ATLAS*” has performed a study of the possibility of observing these modes 
within the Little Higgs Model context for a Z’ in the 1-2 TeV mass range. 
Figure 9.16 from the ATLAS study demonstrates how difficult observing 
these decays may really be due to the very large SM backgrounds. It is thus 
unlikely that these modes will provide any important information except in 
very special cases. 
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for a number of models. (b) SSM Z’ dijet signal for various masses in comparison with 
the SM background. 
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Fig. 9.16. Search for heavy flavor decays of the Z’ in the Little Higgs model by ATLAS. 
cot 07; = 1 has been assumed. Z’ — bb assuming Mz, = 2 TeV and a luminosity of 300 
fot (a) and tt(b) for Mz, = 1 TeV and a luminosity of 30 fb™t. 


Another possible 2-body channel is Z’ — WtW~, which can occur at 
a reasonable rate through Z-Z’ mixing as discussed above. Clearly the rate 
for this mode is very highly model dependent. ATLAS®® has made a pre- 
liminary analysis of this mode in the jjlv final state taking the Z’ to be that 
of the SSM(for its fermionic couplings) and assuming a large integrated lu- 
minosity of 300 fb~!. The mixing parameter ( was taken to be unity in the 
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Fig. 9.17. Results of two ATLAS analyses showing the Z’ — WW signal above SM 
backgrounds and Z’ mass reconstruction in this channel for the SSM model assuming 
Mz = 1.5 TeV and 8 = 1. 


calculations. The authors of this analysis found that a Z’ in the mass range 
below ~ 2.2 TeV could be observed in this channel given these assumptions. 
An example is shown in Fig. 9.17 where we clearly see the reconstructed Z’ 
above the SM background. With a full detailed background study an esti- 
mate could likely be made of the relevant branching fraction in comparison 
to that of the discovery mode. This would give important information on 
the nature of the Z’ coupling structure. More study of this mode is needed. 

A parallel study was performed by ATLAS*! for the Z’ — ZH mode 
which also occurs through mixing as discussed above; this mixing occurs 
naturally in the Little Higgs model in the absence of T-parity. The results 
are shown in Fig. 9.18. Here we see that there is a respectable signal over 
background and the relevant coupling information should be obtainable 
provided the Z’ is not too heavy. 

Some rare decays of the Z’ may be useful in obtaining coupling infor- 
mation provided the Z’ is not too massive. Consider the ratios of Z’ partial 
widths°4:59-61 


PA > FFV) 


TFV = T(z = IHT) 5 (9.26) 
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Fig. 9.18. Search study for the decay Z’ — ZH by ATLAS in the Little Higgs model 
assuming cot 07; = 0.5 for the 1+1— bb mode assuming Mz, = 1 (a) or 2(b) TeV. 


where V =Z,W and ff’ = I*I-,l*v,vv, appropriately. The two [(Z’ — 
ffZ) (with f =1,v) partial widths originate from the bremsstrahlung of a 
SM Z off of either the f or f legs and are rather to imagine. Numerically, 
one finds that for the case f = l, little sensitivity to the Z’ couplings is 
obtained so it is not usually considered. Assuming that the SM v’s couple 
in a left-handed way to the Z’, it is clear that rz = Kzv2/(v2 + a2), 
where Kz is a constant, model-independent factor for any given Z’ mass. 
The signal for this decay is a (reconstructed) Z plus missing pr with a 
Jacobean peak at the Z’ mass. 

riyw, on the otherhand, is more interesting; not only can the W be 
produced as a brem but it can also arise directly if a WWZ’ coupling exists. 
As we saw above this can happen if Z-Z’ mixing occurs or it can happen if T” 
is proportional to Tsz. If there is no mixing and if T” has no Tzr, component 
then one finds the simple relation ruw = Kwv,2/(v2 + a2), with Kw 
another constant factor. Note that now rj,w and ryyz are proportional to 
one another and, since T’ and Tsz commute, one also has v! +a’, = v’, +a’, = 
2v!, so that both rww and ryyz are bounded, i.e., 0 < rww < Kw/2 and 
0 < rwz < Kz/2. Thus, e.g., in Eg models a short analysis shows that 
the allowed region in the ripw,rvvz plane will be a straight line beginning 
at the origin and ending at Kw /2,Kz/2. Other common models will lie 
on this line, such as the LRM and ALRM cases, but some others, e.g., the 
SSM, will lie elsewhere in this plane signaling the fact that T’ contains a 
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Fig. 9.19. Predictions for the rare decay mode ratios for a number of different models 
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line is the Ee case. 


Tz; component. Figure 9.19 from®! shows a plot of these parameters for a 
large number of models, the solid line being the just discussed Eg case and 
‘S’ the SSM result. 

While the coupling information provided by these ratios is very useful, 
the Z’ event rates necessary to extract them are quite high in most cases 
due to their small relative branching fractions. For a Z’ much more massive 
than 1-2 TeV the statistical power of these observables will be lost. 

A different way to get at the Z’ couplings is to produce it in association 
with another SM gauge boson, i.e., a photon® or a W*+,Z,°? with the Z’ 
decaying to dileptons as usual. Taking the ratio of this cross section to that 
in the discovery channel, we can define the ratios 


Ray = o(q¢ > Z'V)B(Z = I+) 
av olg > ZB 3 FT) 
in the NWA with V = y,W=, or Z. (For the case V = g there is little 
coupling sensitivity.?) Note that B trivially cancels in this ratio but it 
remains important for determining statistics. The appearance of an extra 
particle V in the final state re-weights the combination of couplings which 
appears in the cross section so that one can get a handle on the vector 


(9.27) 


and axial-vector couplings of the initial u’s and d’s to the Z’. For example, 
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Fig. 9.20. Ry in Eg models for a 1 TeV Z’ employing a cut Dr > 50 GeV. 


in the simple case of V = y, the associated parton level qq —Z’y cross 
section is proportional to $`; Q?(v,;2 +.a,2) while the simple Z’ cross section 
is proportional to 5>,(v,? + a,2). Similarly, for the case V =W, the cross 
section is found to be proportional to }>,(v; + a/)*. Tagging the additional 
V, when V Æ y, may require paying the price of leptonic branching fractions 
for the W and Z, which is a substantial rate penalty, although an analysis 
has not yet been performed. For the case of V = y, a hard pr cut on the 
will be required but otherwise the signature is very clean. All the ratios 
Rzy are of order a few x107? (or smaller once branching fractions are 
included) for a Z’ mass of 1 TeV and (with fixed cuts) tend to grow with 
increasing Mz. For example, for a 1 TeV Z’ in the Eg model, the cross 
section times leptonic branching fraction for the Z’y final state varies in the 
range 0.65-1.6 fb, depending upon the parameter 0, assuming a photon pr 
cut of 50 GeV. Ry for this case is shown in Fig. 9.20. Generically, with 100 
fb! of luminosity these ratios might be determined at the level of ~ 10% 
for the Mz, = 1 TeV case but the quality of the measurement will fall 
rapidly as Mz, increased due to quickly falling statistics. For much larger 
masses these ratios are no longer useful. It is possible that the Tevatron 
will tell us whether such light masses are already excluded. 

It is clear from the above discussion that there are many tools available 
at the LHC for Z’ identification. However, many of these are only applicable 
if the Z’ is relatively light. Even if all these observables are available it still 
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Fig. 9.21. Z’ search reach at a \/s=0.5 TeV(a) or 1 TeV(b) ILC as a function of the 
integrated luminosity without(solid) or with(dashed) 60% positron beam polarization 
for models (green), (red), SSM(magenta) and LRM with « = 1(blue). 


remains unclear as to whether or not the complete set of Z’ couplings can 
be extracted from the data with any reliability. A detailed analysis of this 
situation has yet to be performed. We will probably need a Z’ discovery 
before it is done. 


9.7. ILC: What Comes Next 


The ILC will begin running a decade or so after the turn on of the LHC. At 
that point perhaps as much as ~ 1 ab~! or more of integrated luminosity 
will have been delivered by the LHC to both detectors. From our point of 
view, the role of the ILC would then be to either extend the Z’ search reach 
(in an indirect manner) beyond that of the LHC or to help identify any Z’ 
discovered at the LHC.®4 

Although the ILC will run at ys = 0.5 — 1 TeV, we know from our dis- 
cussion of LEP Z’ searches that the ILC will be sensitive to Z’ with masses 
significantly larger than \/s. Figure 9.21°° shows the search reach for var- 
ious Z’ models assuming ys = 0.5,1 TeV as a function of the integrated 
luminosity both with and without positron beam polarization. Recall that 
the various final states ete~ — ff, f = e,,7,c¢,b,t can all be used si- 
multaneously to obtain high Z’ mass sensitivity. The essential observables 
employed here are do/dz and Aypr(z), which is now available since the e7 
beam is at least 80% polarized. One can also measure the polarization of 
7’s in the final state. This figure shows that the ILC will be sensitive to Z’ 
masses in the range (7 — 14),/s after a couple of years of design luminosity, 
the exact value depending on the particular Z’ model. Thus we see that 
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Fig. 9.22. A comparison of LHC direct and ILC indirect Z’ search reaches. 


it it relatively easy at the ILC to extend the Z’ reach beyond the 5-6 TeV 
value anticipated at the LHC. Figure 9.22 from® shows a comparison of the 
direct Z’ search reach at the LHC with the indirect reach at the ILC; note 
the very modest values assumed here for the ILC integrated luminosities. 
Here we see explicitly that the ILC has indirect Z’ sensitivity beyond the 
direct reach of the LHC. 

In the more optimistic situation where a Z’ is discovered at the LHC, the 
ILC will be essential for Z’ identification. As discussed above, it is unclear 
whether or not the LHC can fully determine the Z’ couplings, especially if 
it were much more massive than ~ 1 TeV. 

Once a Z’ is discovered at the LHC and its mass is determined, we 
can use the observed deviations in both do/dz and Ayr(z) at the ILC to 
determine the Z’ couplings channel by channel. For example, assuming 
lepton universality (which we will already know is applicable from LHC 
data), we can examine the processes ete” — I*/~ using Mz as an input 
and determine both v/ and a’, (up to a two-fold overall sign ambiguity); a 
measurement of 7 polarization can also contribute in this channel. With 
this knowledge, we can go on to the ete~ — bb channel and perform a 
simultaneous fit to v/ , and a’, ,; we could then go on to other channels such 
as cē and tt. In this way all of the Z’ couplings would be determined. An 
example of this is shown in Fig. 9.23 from®’ where we see the results of the 
Z' coupling determinations at the ILC in comparison with the predictions 
of a number of different models. 
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Fig. 9.23. The ability of the ILC to determine the Z’ leptonic couplings for a few 
representative models. 


9.8. Summary 


The LHC turns on at the end of next year and a reasonable integrated 
luminosity ~ 1 fb~+ will likely be accumulated in 2008 at ys = 14 TeV. 
The community-wide expectation is that new physics of some kind will 
be seen relatively ‘soon’ after this (once the detectors are sufficiently well 
understood and SM backgrounds are correctly ascertained). Many new 
physics scenarios predict the existence of a Z’ or Z’-like objects. It will 
then be up to the experimenters (with help from theorists!) to determine 
what these new states are and how they fit into a larger framework. In 
our discussion above, we have provided an overview of the tools which 
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experiments at the LHC can employ to begin to address this problem. To 
complete this program will most likely require input from the ILC. 

No matter what new physics is discovered at the LHC the times ahead 
should prove to be very exciting. 
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Chapter 10 


Neutrinoless Double Beta Decay 


Petr Vogel 
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Caltech, Pasadena, CA 91125, USA* 


10.1. Introduction - fundamentals of GG decay 


In the recent past neutrino oscillation experiments have convincingly shown 
that neutrinos have a finite mass. However, in oscillation experiments only 
the differences of squares of the neutrino masses, Am? = |m3 — m?|, can 
be measured, and the results do not depend on the charge conjugation 
properties of neutrinos, i.e., whether they are Dirac or Majorana fermions. 
Nevertheless, a lower limit on the absolute value of the neutrino mass scale, 
Mscale = \/|Am?|, has been established in this way. Its existence, in turn, 
is causing a renaissance of enthusiasm in the double beta decay community 
which is expected to reach and even exceed, in the next generation of exper- 
iments, the sensitivity corresponding to this mass scale. Below I review the 
current status of the double beta decay and the effort devoted to reach the 
required sensitivity, as well as various issues in theory (or phenomenology) 
concerning the relation of the OvGG decay rate to the absolute neutrino 
mass scale and to the general problem of the Lepton Number Violation 
(LNV). 

But before doing that I very briefly summarize the achievements of the 
neutrino oscillation searches and the role that the search for the neutrino- 
less double beta decay plays in the elucidation of the pattern of neutrino 
masses and mixing. In these introductory remarks I use the established 
terminology, some of which will be defined only later in the text. 

There is a consensus that the measurement of atmospheric neutrinos by 
the SuperKamiokande collaboration! can be only interpreted as a conse- 
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quence of the nearly maximum mixing between v, and v, neutrinos, with 
the corresponding mass squared difference |Am?,,,,| ~ 2.4 x 1073 eV. This 
finding was confirmed by the K2K experiment? that uses accelerator v, 
beam pointing towards the SuperKamiokande detector 250 km away, as 
well as by the very recent first result of the MINOS experiment located 
at the Sudan mine in Minnesota, 735 km away from the Fermilab. Sev- 
eral large long-baseline experiments are being built to further elucidate this 
discovery, and determine the corresponding parameters more accurately. 

At the same time the “solar neutrino puzzle”, which has been with us 
for over thirty years since the pioneering chlorine experiment of Davis,* also 
reached the stage where the interpretation of the measurements in terms of 
oscillations between the ve and some combination of the active, i.e., v, and 
v, neutrinos, is inescapable. In particular, the juxtaposition of the results 
of the SNO experiment” and SuperKamiokande,® together with the earlier 
solar neutrino flux determination in the gallium experiments,”* leads to 
that conclusion. The value of the corresponding oscillation parameters, 
however, remained uncertain, with several “solutions” possible, although 
the so-called Large Mixing Angle (LMA) solution with sin? 20so1 ~ 0.8 and 
Am?2,, ~ 1074 eV? was preferred. A decisive confirmation of the “solar” 
oscillations was provided by the nuclear reactor experiment KamLAND®!° 
that demonstrated that the flux of the reactor De is reduced and its spectrum 
distorted at the average distance ~ 180 km from nuclear reactors. 

The pattern of neutrino mixing is further simplified by the constraint 
due to the Chooz and Palo Verde reactor neutrino experiments!!:!? which 
lead to the conclusion that the third mixing angle, 013, is small, sin? 20,3 < 
0.1. The two remaining possible neutrino mass patterns are illustrated in 
Fig. 10.1. 

Altogether, clearly a lower limit for at least one of the neutrino masses, 
VAm?,, œ 0.05 eV has been established. However, the oscillation ex- 
periments cannot determine the absolute magnitude of the masses and, in 
particular, cannot at this stage separate two rather different scenarios, the 
hierarchical pattern of neutrino masses in which m ~ v Am? and the de- 
generate pattern in which m > VAm?. It is hoped that the search for 
the neutrinoless double beta decay, reviewed here, will help in foreseeable 
future in determining, or at least narrowing down, the absolute neutrino 
mass scale, and in deciding which of these two possibilities is applicable. 

Moreover, the oscillation results do not tell us anything about the prop- 
erties of neutrinos under charge conjugation. While the charged leptons 
are Dirac particles, distinct from their antiparticles, neutrinos may be the 
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Fig. 10.1. Schematic illustration (mass intervals not to scale) of the decomposition of 
the neutrino mass eigenstates v; in terms of the flavor eigenstates. The two hierarchies 
cannot be, at this time, distinguished. The small admixture of ve into v3 is an upper 
limit. 


ultimate neutral particles, as envisioned by Majorana, that are identical to 
their antiparticles. That fundamental distinction becomes important only 
for massive particles and becomes irrelevant in the massless limit. Neutrino- 
less double beta decay proceeds only when neutrinos are massive Majorana 
particles, hence its observation would resolve the question. 

Double beta decay (3) is a nuclear transition (Z, A) > (Z + 2, A) in 
which two neutrons bound in a nucleus are simultaneously transformed into 
two protons plus two electrons (and possibly other light neutral particles). 
This transition is possible and potentially observable because nuclei with 
even Z and N are more bound than the odd-odd nuclei with the same 
A= N+ Z. Analogous transition of two protons into two neutrons are 
also, in principle, possible in several nuclei, but phase space considerations 
give preference to the former mode. 

An example is shown in Fig. 10.2. The situation shown there is not 
exceptional. There are eleven analogous cases (candidate nuclei) with the 
Q-value (i.e., the energy available to leptons) in excess of 2 MeV. 

There are two basic modes of the GG decay. In the two-neutrino mode 
(2v6) there are two De emitted together with the two e~. Lepton number 
is conserved and this mode is allowed in the standard model of electroweak 
interaction. It has been repeatedly observed by now in a number of cases 
and proceeds with a typical half-life of ~ 10?°years. In contrast, in the 
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L 1 
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Fig. 10.2. Atomic masses of the isotopes with A = 136. Nuclei 136Xe, 136 Ba and 136 Ce 
are stable against the ordinary 8 decay; hence they exist in nature. However, energy 
conservation alone allows the transition 136Xe > 136 Ba + 2e7 (+ possibly other neutral 
light particles) and the analogous decay of 1°°Ce with the positron emission. 


neutrinoless mode (0v) only the 2e~ are emitted and nothing else. That 
mode clearly violates the law of lepton number conservation and is forbid- 
ded in the standard model. Hence, its observation would be a signal of a 
“new physics”. 

The two modes of the GG decay have some common and some distinct 
features. The common features are: 


e The leptons carry essentially all available energy. The nuclear recoil 
is negligible, Q/Am, < 1. 

e The transition involves the 0+ ground state of the initial nucleus 
and (in almost all cases) the 0* ground state of the final nucleus. In 
few cases the transition to an excited 0* state in the final nucleus 
is energetically possible, but suppressed by the smaller phase space 
available. (But the 2v@@ decay to the excited 07 state has been 
observed in few cases.) 

e Both processes are of second order of weak interactions, ~ G$, 
hence inherently slow. The phase space consideration alone (for the 
2v38 mode ~ Q!! and for the 0vV83 mode ~ Q?) give preference 
to the 0vG@ which is, however, forbidden by the lepton number 
conservation. 


The distinct features are: 


e In the 2vG6 mode the two neutrons undergoing the transition are 
uncorrelated (but decay simultaneously) while in the 0v3G the two 
neutrons are correlated. 
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e Inthe 2v86 mode the sum electron kinetic energy Tı +72 spectrum 
is continuous and peaked below Q/2. As Tı +72 — Q the spectrum 
approaches zero approximately like (AE/Q)°. 

e On the other hand, in the 0vGG mode Tı + Tz = Q smeared only 
by the detector resolution. 


These last features allow one to separate the two modes experimentally 
by observing the sum electron spectrum with a good energy resolution, even 
if the corresponding decay rate for the 0vGG mode is much smaller than 
for the 2v8G mode. This is illustrated in Fig. 10.3 where the insert that 
includes the Ov peak and the 2v tail shows the situation for the rate ratio 
of 1: 10° corresponding to the most sensitive current experiments. 

Various aspects, both theoretical and experimental, of the GG decay 
have been reviewed many times. Here I quote just the more recent review 
articles, 13716 

In this introductory section let me make only few general remarks. The 
existence of the 0vG@ decay would mean that on the elementary particle 
level a six fermion lepton number violating amplitude transforming two u 


earlier references can be found there. 


quarks into two d quarks and two electrons is nonvanishing. As was first 
pointed out by Schechter and Valle!” more than twenty years ago, this fact 
alone would guarantee that neutrinos are massive Majorana fermions (see 
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Fig. 10.3. Separating the 0vGG mode from the 2vGG by the shape of the sum elec- 
tron spectrum (kinetic energy Ke of the two electrons), including the effect of the 2% 
resolution smearing. The assumed 2v/0v rate ratio is 10, and 10° in the insert. 
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Fig. 10.4. By adding loops involving only standard weak interaction processes the 0v 88 
decay amplitude (the black box) implies the existence of the Majorana neutrino mass. 


Fig. 10.4). This qualitative statement (or theorem), unfortunately, does 
not allow us to deduce the magnitude of the neutrino mass once the rate 
of the 0vGG decay have been determined. 

There is no indication at the present time that neutrinos have nonstan- 
dard interactions, i.e. they seem to have only interactions carried by the 
W and Z bosons that are contained in the Standard Electroweak Model. 
All observed oscillation phenomena can be understood if one assumes that 
that neutrinos interact exactly the way the Standard Model prescribes, but 
are massive fermions forcing a generalization of the model. If we accept 
this, but in addition assume that neutrinos are Majorana particles, we can 
in fact relate the OvGG decay rate to the quantity related to the absolute 
neutrino mass. With these caveats that relation can be expressed as 


1 


Ti 


= G” (Q, Z)|M°"|? (map)? , (10.1) 


where G% (Q, Z) is a phase space factor that depends on the transition 
Q value and through the Coulomb effect on the emitted electrons on the 
nuclear charge Z and that can be easily and accurately calculated, M°” 
is the nuclear matrix element that can be evaluated in principle, although 
with a considerable uncertainty, and finally the quantity (mgg) is the effec- 
tive neutrino Majorana mass, representing the important particle physics 
ingredient of the process. 

In turn, the effective mass (mgg) is related to the mixing angles 0;j 
that are determined or constrained by the oscillation experiments, to the 
absolute neutrino masses m; of the mass eigenstates v; and to the as of 
now totally unknown additional parameters as fundamental as the mixing 
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angles 0;j, the so-called Majorana phase a(i), 
(mgp) = [2il Uei PeO mal . (10.2) 


Here Ue; are the matrix elements of the first row of the neutrino mixing 
matrix. 

It is straightforward to use the eq. (10.2) and the known neutrino os- 
cillation results in order to relate (mgg) to other neutrino mass dependent 
quantities. This is illustrated in Fig. 10.5. Traditionally such plot is made 
as in the left panel. However, the lightest neutrino mass Mmin is not an 
observable quantity. For that reason the other two panels show the relation 
of (mgg) to the sum of the neutrino masses M = Xm, and also to (mg) 
that represents the parameter that can be determined or constrained in 
ordinary ( decay, 


(mg)? = £i Uai |’ m} . (10.3) 


Several remarks are in order. First, the observation of the 0vGG decay 
and determination of (mgg), even when combined with the knowledge of 
M and/or (mg) does not allow, in general, to distinguish between the 
normal and inverted mass orderings. This is a consequence of the fact that 
the Majorana phases are unknown. In regions in Fig. 10.5 where the two 
hatched bands overlap it is clear that two solutions with the same (mgg) 
and the same M (or the same (mg)) exist and cannot be distinguished. 

On the other hand, obviously, if one can determine that (mgg) > 0.1 
eV we would conclude that the mass pattern is degenerate. And in the so 
far hypothetical case that one could show that (mgg) < 0.01 - 0.02 eV but 
nonvanishing nevertheless the normal hierarchy would be established.* 

It is worthwhile noting that if the inverted mass ordering is realized 
in nature, (and neutrinos are Majorana particles) the quantity (mgg) is 
constrained from below by ~ 0.01 eV. This value is within reach of the 
next generation of experiments. Also, in principle, in the case of the normal 
hierarchy while all neutrinos could be massive Majorana particles it is still 
possible that (mgg) = 0. Such a situation, however, requires “fine tuning” 
or reflects a symmetry of some kind. 

Let us remark that the 0vGG@ decay is not the only LNV process for 
which important experimental constraints exist. Examples of the other 
“In that case also the (mg) in the right panel would not represent the quatity directly 


related to the ordinary 8 decay. There are no realistic ideas, however, how to reach the 
corresponding sensitivity in ordinary 8 decay at this time. 
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Fig. 10.5. The left panel shows the dependence of (mgg) on the mass of the lightest 
neutrino Mmin, the middle one shows the relation between (mgg) and the sum of neutrino 
masses M = Xm; determined or constrained by the “observational cosmology”, and the 
right one depicts the relation between (mgg) and the effective mass (mg) determined 
or constrained by the ordinary 8 decay. In all panels the width of the hatched area is 
due to the unknown Majorana phases and therefore irreducible. The solid lines indicate 
the allowed regions by taking into account the current uncertainties in the oscillation 
parameters; they will shrink as the accuracy improves. The two sets of curves correspond 
to the normal and inverted hierarchies, they merge above about (mgg) > 0.1 eV, where 
the degenerate mass pattern begins. 


analogous processes are 
uT +(Z,A) > e+ + (Z — 2, A); exp. branching ratio < 107’? , 
Kt — ptutn; exp. branching ratio < 3 x 107° , 


De emission from the Sun; exp. branching ratio < 1074 . (10.4) 


However, detailed analysis suggests that the study of the 0v@G decay is 
by far the most sensitive test of LNV. In simple terms, this is caused by 
the amount of tries one can make. A 100 kg Ov decay source contains 
~ 10?” nuclei. This can be contrasted with the possibilities of first pro- 
ducing muons or kaons, and then searching for the unusual decay channels. 
The Fermilab accelerators, for example, produce “a few” x 102° protons on 
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target per year in their beams and thus correspondingly smaller numbers 
of muons or kaons. 


10.2. Mechanism of the 0vGG decay 


It has been recognized long time ago that the relation between the 0vGG 
decay rate and the effective Majorana mass (mgg) is to some extent prob- 
lematic. The assumption leading to the eq. (10.1) is rather conservative, 
namely that there is an exchange of a virtual light, but massive, Majorana 
neutrino between the two nucleons undergoing the transition, and that 
these neutrinos interact by the standard left-handed weak currents. How- 
ever, that is not the only possible mechanism. LNV interactions involving 
so far unobserved heavy (~ TeV) particles can lead to a comparable 0v33 
decay rate. Thus, in the absence of additional information about the mech- 
anism responsible for the 0v8 decay, one could not unambiguously infer 
(mga) from the 0v8 decay rate. 

In general 0vGG decay can be generated by (i) light massive Majorana 
neutrino exchange or (ii) heavy particle exchange (see, e.g. Refs. 18,19), 
resulting from LNV dynamics at some scale A above the electroweak one. 
The relative size of heavy (Az) versus light particle (Az) exchange contri- 
butions to the decay amplitude can be crudely estimated as follows:?° 

m My A Mọ (k? 

Ar ~ GÈ a An~ Gee, T ~ os (10.5) 
where (mgg) is the effective neutrino Majorana mass, (k?) ~ (50 MeV)? 
is the typical light neutrino virtuality, and A is the heavy scale relevant to 
the LNV dynamics. Therefore, Ag /Azr ~ O(1) for (mgg) ~ 0.1 — 0.5 eV 
and A ~ 1 TeV, and thus the LNV dynamics at the TeV scale leads to 
similar 0vG decay rate as the exchange of light Majorana neutrinos with 
the effective mass (mgg) ~ 0.1 — 0.5 eV. 

Obviously, the lifetime measurement by itself does not provide the 
means for determining the underlying mechanism. The spin-flip and non- 


flip exchange can be, in principle, distinguished by the measurement of the 
single-electron spectra or polarization (see e.g.,7!). However, in most cases 
the mechanism of light Majorana neutrino exchange, and of heavy particle 
exchange cannot be separated by the observation of the emitted electrons. 
Thus one must look for other phenomenological consequences of the differ- 
ent mechanisms other than observables directly associated with 0vGG. Here 
I discuss the suggestion?” that under natural assumptions the presence of 
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low scale LNV interactions also affects muon lepton flavor violating (LFV) 
processes, and in particular enhances the  — e conversion compared to 
the u — ey decay. 

The discussion is concerned mainly with the branching ratios Bye, = 
T(u > ey) /TO) and Bue = Tconv/Tcapt, where u — ey is normalized 
to the standard muon decay rate rT® = (Gim?)/(1927°), while y — e 
conversion is normalized to the corresponding capture rate Pcapt. The main 
diagnostic tool in the analysis is the ratio 


R = Bye/Byusey » (10.6) 


and the relevance of our observation relies on the potential for LFV discov- 
ery in the forthcoming experiments MEG?’ (u — ey) and MECO”4 (u > e 
conversion)” that plan to improve the current limits by several orders of 
magnitude. 

It is useful to formulate the problem in terms of effective low energy 
interactions obtained after integrating out the heavy degrees of freedom 
that induce LNV and LFV dynamics. If the scales for both LNV and LFV 
are well above the weak scale, then one would not expect to observe any 
signal in the forthcoming LFV experiments, nor would the effects of heavy 
particle exchange enter 0v8 at an appreciable level. In this case, the only 
origin of a signal in 0v ZG at the level of prospective experimental sensitivity 
would be the exchange of a light Majorana neutrino, leading to eq. (10.1), 
and allowing one to extract (mgg) from the decay rate. 

In general, however, the two scales may be distinct, as in SUSY-GUT?° 
or SUSY see-saw?° models. In these scenarios, both the Majorana neutrino 
mass as well as LFV effects are generated at the GUT scale. The effects of 
heavy Majorana neutrino exchange in 0v GG are, thus, highly suppressed. In 
contrast, the effects of GUT-scale LFV are transmitted to the TeV-scale by 
a soft SUSY-breaking sector without mass suppression via renormalization 
group running of the high-scale LFV couplings. Consequently, such scenar- 
ios could lead to observable effects in the upcoming LFV experiments but 
with an O(a) suppression of the branching ratio B „>e relative to By ey 
due to the exchange of a virtual photon in the conversion process rather 
than the emission of a real one, thus R ~ 107-3) in this case. 

The case where the scales of LNV and LFV are both relatively low 
(~ TeV) is more subtle. This is the scenario which might lead to observ- 
able signals in LFV searches and at the same time generate ambiguities in 


bEven though MECO experiment was recently cancelled, proposals for experiments with 
similar sensitivity exist elsewhere. 
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interpreting a positive signal in 0vGG. Therefore, this is the case where one 
needs to develop some discriminating criteria. 

Denoting the new physics scale by A, one has a LNV effective lagrangian 
of the form 


Ci ~ $ = - c 
Loves = mG Oi Oi = iq Meqelse™ , (10.7) 


where we have suppressed the flavor and Dirac structures (a complete list 
of the dimension nine operators O; can be found in Ref. 19). 
For the LFV interactions, one has 


Ci 


Lirv = a O; , (10.8) 


and a complete operator basis can be found in Refs. 27,28. The LFV 
operators relevant to our analysis are of the following type (along with 
their analogues with L + R): 


Oz = Cit Op iD C51 FY + he. 


-E 
(47)? 
Orr = lir Gr Gr lmt (10.9) 
Org = Greb; Tag - 


Operators of the type O, are typically generated at one-loop level, hence 
our choice to explicitly display the loop factor 1/(47)?. On the other hand, 
in a large class of models, operators of the type Og or Og, are generated by 
tree level exchange of heavy degrees of freedom. With the above choices, 
all non-zero c; and č; are nominally of the same size, typically the prod- 
uct of two Yukawa-like couplings or gauge couplings (times flavor mixing 
matrices). 

With the notation established above, the ratio R of the branching ratios 
u > eto u > e+7 can be written schematically as follows (neglecting flavor 
indices in the effective couplings and the term with L e R): 


d : a A? 
R= 73,2 Ay Cot + e" (Azcen + A3ceq) log m? 
+ Aa (47) ?ceq + TA Fiedler Her] (10.10) 


In the above formula 1,2,3,4 are numerical factors of O(1), while the overall 
factor we arises from phase space and overlap integrals of electron and 
muon wavefunctions in the nuclear field. For light nuclei ® = (ZF°)/(gj, + 


39%) ~ O(1) (gv, are the vector and axial nucleon form factors at zero 
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momentum transfer, while F, is the nuclear form factor at q? = —m7?*). 
The dots indicate subleading terms, not relevant for our discussion, such as 
loop-induced contributions to cg and ceq that are analytic in external masses 
and momenta. In contrast the logarithmically-enhanced loop contribution 
given by the second term in the numerator of R plays an essential role. This 
term arises whenever the operators Ogz,r and/or Ogg appear at tree-level 
in the effective theory and generate one-loop renormalization of Ogq?" (see 
Fig. 10.6). 


Or, Org 


ee eas 


Y 
q : _ 4 


Fig. 10.6. Loop contributions to u — e conversion through insertion of operators Og or 
Ogg, generating the large logarithm. 


The ingredients in eq. (10.10) lead to several observations: (i) In absence 
of tree-level cez and ceq, one obtains R ~ (® AZ a)/(127) ~ 107% — 107%, 
due to gauge coupling and phase space suppression. (ii) When present, the 
logarithmically enhanced contributions compensate for the gauge coupling 
and phase space suppression, leading to R ~ O(1). (iii) If present, the 
tree-level coupling ce, dominates the u — e rate leading to R > 1. 

Thus, we can formulate our main conclusions regarding the discriminat- 
ing power of the ratio R: 


(1) Observation of both the LFV muon processes u — e and u —> ey 
with relative ratio R ~ 107? implies, under generic conditions, that 
Torgg ~ (mgg). Hence the relation of the 0v3@ lifetime to the absolute 
neutrino mass scale is straightforward. 

(2) On the other hand, observation of LFV muon processes with relative 
ratio R >> 107? could signal non-trivial LNV dynamics at the TeV 
scale, whose effect on 0vGZ has to be analyzed on a case by case basis. 
Therefore, in this scenario no definite conclusion can be drawn based 
on LFV rates. 
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(3) Non-observation of LFV in muon processes in forthcoming experiments 
would imply either that the scale of non-trivial LFV and LNV is above 
a few TeV, and thus l'o,gg ~ (mgg)?, or that any TeV-scale LNV is 
approximately flavor diagonal. 


The above statements are illustrated using two explicit cases:?? the 
minimal supersymmetric standard model (MSSM) with R-parity violation 
(RPV-SUSY) and the Left-Right Symmetric Model (LRSM). 

RPV SUSY — If one does not impose R-parity conservation |R = 
(—1)3(8-1)+s) the MSSM superpotential includes, in addition to the stan- 
dard Yukawa terms, lepton and baryon number violating interactions, com- 
pactly written as (see e.g.,?%) 


Wrev = Aijk LiL; Ey, + ijk LiQj z+ ArU; D3 D} 
+ u;LiHu F (10.11) 


where L and Q represent lepton and quark doublet superfields, while E°, 
U*, D° are lepton and quark singlet superfields. The simultaneous pres- 
ence of \’ and A” couplings would lead to an unacceptably large proton 
decay rate (for SUSY mass scale Asysy ~ TeV), so we focus on the case 
of A” = 0 and set u’ = 0 without loss of generality. In such case, lep- 
ton number is violated by the remaining terms in Wrpy, leading to short 
distance contributions to 0vGG [e.g., Fig. 10.7(a)], with typical coefficients 
[cf. eq. (10.7)] 
i 12 12 
en XF: 
where as, œ2 represent the strong and weak gauge coupling constants, re- 


spectively. The RPV interactions also lead to lepton number conserving 
but lepton flavor violating operators [e.g. Fig. 10.7(b)], with coefficients 
[cf. eq. (10.8)] 


Ce Mii Ain AfA 
m~ Hli, Ainai 


2 Z `’ 2 ) 

A M5, m$, 
E $ Ix / 

Clq ATA: ATi A21 10.13 

O 2 ma (10.13) 
me, Ma, 

Co AA* NA 

OE ay nt ee 

2 2°? 72 ? 
A m må 


where the flavor combinations contributing to Co can be found in Ref. 30. 
Hence, for generic flavor structure of the couplings À and A’ the underlying 
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LNV dynamics generate both short distance contributions to 0vGG and 
LFV contributions that lead to R > 107?. 

Existing limits on rare processes strongly constrain combinations of 
RPV couplings, assuming Asysy is between a few hundred GeV and 
~1 TeV. Non-observation of LFV at future experiments MEG and MECO 
could be attributed either to a larger Asygy (> few TeV) or to suppression 
of couplings that involve mixing among first and second generations. In the 
former scenario, the short distance contribution to 0v8 does not compete 
with the long distance one [see eq. (10.5)], so that Forga ~ (mgg)?. On the 
other hand, there is an exception to this ” diagnostic tool”. If the A and ’ 
matrices are nearly flavor diagonal, the exchange of superpartners may still 
make non-negligible contributions to 0vG@ without enhancing the ratio R. 


d An e 

à i H A211 d 

u i 
g ‘a 
i U 

Ù d Mit e 

q- d i „e 
Ain 


(a (b) 


Fig. 10.7. Gluino exchange contribution to 0v8 (a), and typical tree-level contribution 
to Ogg (b) in RPV SUSY. 


LRSM — The LRSM provides a natural scenario for introducing non- 
sterile, right-handed neutrinos and Majorana masses.°! The corresponding 
electroweak gauge group SU(2)z x SU(2)r x U(1)B_1, breaks down to 
SU(2), x U(1)y at the scale A > O(TeV). The symmetry breaking is 
implemented through an extended Higgs sector, containing a bi-doublet ® 
and two triplets Az, r, whose leptonic couplings generate both Majorana 
neutrino masses and LFV involving charged leptons: 


Ly” = -Tr' (yf o+ 98 6) Ih 


— (Ex)2* y, Ar L} — (LR) t yi, Ar L}. (10.14) 
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Fig. 10.8. Typical doubly charged Higgs contribution to 0vGG (a) and to Op (b) in the 
LRSM. 


Here 6 = 02P* 09, Arr = io2Az,R, and leptons belong to two isospin 
doublets Li, pR = (v4 rl} r). The gauge symmetry is broken through the 
VEVs (AQ) = vr, (A?) = 0, (®) = diag(1, K2). After diagonalization of 
the lepton mass matrices, LFV arises from both non-diagonal gauge inter- 
actions and the Higgs Yukawa couplings. In particular, the Az, p-lepton 
interactions are not suppressed by lepton masses and have the structure 


L~ AERE hij (1 + 5)é; + h.c.. The couplings hij are n general non- 


diagonal and related to the heavy neutrino mixing matrix. 
Short distance contributions to 0v 88 arise from the exchange of both 
heavy vs and Az pr (Fig. 10.8a), with 


Či 93 1 ; 93 hee 
—~ — ; MS M2 i (10.15) 
where gz is the weak gauge coupling. LFV operators are also generated 
through non-diagonal gauge and Higgs vertices, with? (Fig. 10.8b) 

c huihže Co hih)e f 

-Š FA ar i=e,h,T. (10.16) 
Note that the Yukawa interactions needed for the Majorana neutrino mass 
necessarily imply the presence of LNV and LFV couplings hij and the cor- 
responding LFV operator coefficients cg, leading to R ~ O(1). Again, 
non-observation of LFV in the next generation of experiments would typ- 
ically push A into the multi-TeV range, thus implying a negligible short 
distance contribution to 0v 88. As with RPV-SUSY, this conclusion can be 
evaded by assuming a specific flavor structure, namely ym approximately 


diagonal or a nearly degenerate heavy neutrino spectrum. 
In both of these phenomenologically viable models that incorporate 
LNV and LFV at low scale (~ TeV), one finds R >> 1072.273032 Tt is likely 
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that the basic mechanism at work in these illustrative cases is generic: low 
scale LNV interactions (AL = +1 and/or AL = +2), which in general con- 
tribute to 0v6, also generate sizable contributions to u — e conversion, 
thus enhancing this process over  — ey. 

In conclusion, the above considerations suggest that the ratio R = 
By—e/By—ey of muon LFV processes will provide important insight about 
the mechanism of neutrinoless double beta decay and the use of this process 
to determine the absolute scale of neutrino mass. Assuming observation of 
LFV processes in forthcoming experiments, if R ~ 1072? the mechanism of 
Ov GB is light Majorana neutrino exchange; if R >> 1077, there might be 
TeV scale LNV dynamics, and no definite conclusion on the mechanism of 
Ov G6 can be drawn based only on LFV processes. 


10.3. Overview of the experimental status of search for G8 
decay 


The field has a venerable history. The rate of the 2vGG decay was first 
estimated by Maria Goeppert-Mayer already in 1937 in her thesis work 
suggested by E. Wigner, basically correctly. Yet, first experimental obser- 
vation in a laboratory experiment was achieved only in 1987, fifty years 
later. Why it took so long? As pointed out above, the typical half-life of 
the 2vBG decay is ~ 10% years. Yet, its “signature” is very similar to nat- 
ural radioactivity, present to some extent everywhere, and governed by the 
half-life of ~ 10!° years. So, background suppression is the main problem 
to overcome when one wants to study either of the 68 decay modes. 

During the last two decades the 2v 88 decay has been observed in “live” 
laboratory experiments in many nuclei, often by different groups and using 
different methods. That shows not only the ingenuity of the experimen- 
talists who were able to overcome the background nemesis, but makes it 
possible at the same time to extract the corresponding 2v nuclear matrix 
element from the measured decay rate. In the 2v mode the half-life is given 
by 


1/Tyj2 = G*(Q,Z)|M”? , (10.17) 


where G” (Q, Z) is an easily and accurately calculable phase space factor. 

The resulting nuclear matrix elements M?”, which have the dimension 
energy !, are plotted in Fig. 10.9. Note the pronounced shell dependence; 
the matrix element for !°°Mo is almost ten times larger than the one for 


130Te, Evaluation of these matrix elements, to be discussed below, is an 
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Fig. 10.9. Nuclear matrix elements for the 2vGG decay extracted from the measured 
half-lives. 


important test for the nuclear theory models that aim at the determination 
of the analogous but different quantities for the Ov neutrinoless mode. 

The challenge of detecting the 0v@G decay is, at first blush, easier. 
Unlike the continuous 2vG@ decay spectrum with a broad maximum at 
rather low energy where the background suppression is harder, the 0vGG 
decay spectrum is sharply peaked at the known Q value (see Fig. 10.3), 
at energies that are not immune to the background, but a bit less difficult 
to manage. However, as also indicated in Fig. 10.3, to obtain interesting 
results at the present time means to reach sensitivity to the Ov half-lives 
that are ~ 10° times longer than the 2v decay half-life of the same nucleus. 
So the requirements of background suppression are correspondingly even 
more severe. 

The historical lessons are illustrated in Fig. 10.10 where the past limits 
on the 0v8 decay half-lives of various candidate nuclei are translated using 
the eq. (10.1) into the limits on the effective mass (mgg). When plotted 
in the semi-log plot this figure represents the “Moore’s law” of double beta 
decay, and indicates that, provided that the past trend will continue, the 
mass scale corresponding to Am2,,, will be reached in about 10 years. 
This is also the time scale of significant experiments these days. Indeed, 


as discussed further, preparations are on the way to reach this sensitivity 
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Fig. 10.10. The limit of the effective mass (mgg) extracted from the experimental lower 
limits on the 0vGG decay half-life versus the corresponding year. The gray band near 


bottom indicates the Am? value. Figure originally made by S. Elliott. 


atm 


goal. Note that the figure was made using some assumed values of the 
corresponding nuclear matrix elements, without including their uncertainty. 
For such illustrative purposes they are, naturally, irrelevant. 

The past search for the neutrinoless double beta decay, illustrated in 
Fig. 10.10, was driven by the then current technology and the resources of 
the individual experiments. The goal has been simply to reach sensitivity 
to longer and longer half-lives. The situation is different, however, now. 
The experimentalists at the present time can and do use the knowledge 
summarized in Fig. 10.5 to gauge the aim of their proposals. Based on that 
figure, the range of the mass parameter (mgg) can be divided into three 
regions of interest. 
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e The degenerate mass region where all m; > \/Am?,,,,. In that region 
(mgg) > 0.1 eV, corresponding crudely to the Ov half-lives of 1026-27 
years. To explore it (in a realistic time frame), ~ 100 kg of the decay- 
ing nucleus is needed. Several experiments aiming at such sensitivity 
are being built and should run and give results within the next 3-5 
years. Moreover, this mass region (or a substantial part of it) will be 
explored, in a similar time frame, by the study of ordinary 6 decay 
(in particular of tritium) and by the observational cosmology. These 
techniques are independent on the Majorana nature of neutrinos. It is 
easy, but perhaps premature, to envision various scenarios depending 
on the possible outcome of these measurements. 

The so-called inverted hierarchy mass region where 20 < (mgg) < 100 
meV and the 0vG@ half-lives are about 1027728 years. (The name is 
to some extent a misnomer. In that interval one could encounter not 


only the inverted hierarchy but also a quasi-degenerate but normal 
neutrino mass ordering. Successful observation of the 0vG( decay will 
not be able to distinguish these possibilities, as I argued above. This 
is so not only due to the anticipated experimental accuracy, but more 
fundamentally due to the unknown Majorana phases.) To explore this 
mass region, ~ ton size sources would be required. Proposals for the 
corresponding experiments exist, but none has been funded as yet, and 
presumably the real work will begin depending on the experience with 
the various ~ 100 kg size sources. Timeline for exploring this mass 
region is ~ 10 years. 

e Normal mass hierarchy region where (mgg) < 10-20 meV. To explore 
this mass region, ~ 100 ton sources would be required. There are no 
realistic proposals for experiments of this size at present. 


Over the last two decades, the methodology for double beta decay ex- 
periments has improved considerably. Larger amounts of high-purity en- 
riched parent isotopes, combined with careful selection of all surrounding 
materials and using deep-underground sites have lowered backgrounds and 
increased sensitivity. The most sensitive experiments to date use “Ge, 
100Mo, 16Cd, Te, and Xe. For Ge the lifetime limit reached im- 


33,34 The experimental lifetime limits 


pressive values exceeding 10?°years. 
have been interpreted to yield effective neutrino mass limits typically a few 
eV and in Ge as low as 0.3 - 1.0 eV (the spread reflects an estimate of the 
uncertainty in the nuclear matrix elements). The sum electron spectrum 


obtained in the Heidelberg-Moscow*? experiment is shown in Fig. 10.11 
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Fig. 10.11. The spectrum recorded in the Heidelberg-Moscow ( decay experiment on 
T6Ge. Identified y lines are indicated. 


over a broad energy range, and in Fig. 10.12 over a narrower range in the 
vicinity of the Ov Q value of 2039 keV. Some residual natural radioactivity 
background lines are clearly visible in both figures, and no obvious peak at 
the Ov expected position can be seen in Fig. 10.12. 

Nevertheless, a subset of members of the Heidelberg-Moscow collab- 
oration reanalyzed the data (and used additional information, e.g. the 
pulse-shape analysis and a different algorithm in the peak search) and 
claimed to observe a positive signal corresponding to the effective mass of 
(mag) = 0.3975'54 eV.2° That report has been followed by a lively discus- 
sion. Clearly, such an extraordinary claim with its profound implications, 
requires extraordinary evidence. It is fair to say that a confirmation, both 
for the same “Ge parent nucleus, and better yet also in another nucleus 
with a different Q value, would be required for a consensus. In any case, 
if that claim is eventually confirmed, the degenerate mass scenario will be 
implicated, and eventual positive signal in the analysis of the tritium (@ 
decay and/or observational cosmology should be forthcoming. For the neu- 
trinoless 68 decay the next generation of experiments, which use ~ 100 kg 
of decaying isotopes will, among other things, test this recent claim. 

It is beyond the scope of these lecture notes to describe in detail the 
forthcoming 0v GG decay experiments. Rather detailed discussion of them 
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Fig. 10.12. Spectrum of the Heidelberg-Moscow experiment in the vicinity of the 0vGG 
decay value of 2039 keV. 


can be found e.g. in Ref. 16. Also, the corresponding chapter of the APS 
neutrino study*® has various details. Nevertheless, let me briefly comment 
on the most advanced of the forthcoming ~ 100 kg source experiments 
CUORE, GERDA, EXO, and MAJORANA. All of them are designed to ex- 
plore all (or at least most) of the degenerate neutrino mass region (mag) > 
0.1 eV. If their projected efficiencies and background projections are con- 
firmed, all of them plan to consider scaling up the decaying mass to ~ ton 
and extend their sensitivity to the “inverted hierarchy” region. 

These experiments use different nuclei as a source, “Ge for GERDA and 
MAJORANA, !°°Te for CUORE, and !°°Xe for EXO. The requirement of 
radiopurity of the source material and surrounding auxiliary equipment is 
common to all of them, as is the placement of the experiment deep under- 
ground to shield against cosmic rays. The way the electrons are detected 
is, however, different. While the germanium detectors with their superb 
energy resolution have been used for the search of the 0vGG decay for a 
long time, the cryogenic detectors in CUORE use the temperature increase 
associated with an event in the very cold TeO2 crystals, and in the EXO 
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experiment a Time Projection Chamber (TPC) uses both scintillation and 
ionization to detect the events. The EXO experiment in its final form (still 
under development and very challenging) would use a positive identification 
of the final Ba* ion as an ultimate background rejection tool. These four 
experiments are in various stages of funding and staging. First results are 
expected in about 3 years, and substantial results within 3-5 years in all 
of them. 


10.4. Nuclear matrix elements 


It follows from eq. (10.1) that (i) values of the nuclear matrix elements 
M°” are needed in order to extract the effective neutrino mass from the 
measured 0v33 decay rate, and (ii) any uncertainty in M°” causes a corre- 
sponding and equally large uncertainty in the extracted (mgg) value. Thus, 
the issue of an accurate evaluation of the nuclear matrix elements attracts 
considerable attention. 

To see qualitatively where the problems are, let us consider the so- 
called closure approximation, i.e. a description in which the second order 
perturbation expression is approximated as 


M” = (T fina [ÔO |V initial) . (10.18) 


Now, the challenge is to use an appropriate many-body nuclear model 
to describe accurately the wave functions of the ground states of the initial 
and final nuclei, |Winitiat) and |Y fina), as well as the appropriate form 
of the effective transition operator O©”) that describes the transforma- 
tion of two neutrons into two protons correlated by the neutrino prop- 
agator, and consistent with the approximations inherent to the nuclear 
model used. 

Common to all methods is the description of the nucleus as a system of 
nucleons bound in the mean field and interacting by an effective residual 
interaction. The used methods differ as to the number of nucleon orbits (or 
shells and subshells) included in the calculations and the complexity of the 
configurations of the nucleons in these orbits. The two basic approaches 
used so far for the evaluation of the nuclear matrix elements for both the 2v 
and Ov 3G decay modes are the Quasiparticle Random Phase Approxima- 
tion (QRPA) and the nuclear shell model (NSM). They are in some sense 
complementary; QRPA uses a larger set of orbits, but truncates heavily the 
included configurations, while NSM can include only a rather small set of 
orbits but includes essentially all possible configurations. NSM also can be 
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tested in a considerable detail by comparing to the nuclear spectroscopy 
data; in QRPA such comparisons are much more limited. 

For the 2v decay one can relate the various factors entering the cal- 
culations to other observables ( strength functions, cross sections of the 
charge-exchange reactions, etc.), accessible to the experiment. The consis- 
tency of the evaluation can be tested in that way. Of course, as pointed out 
above (see Fig. 10.9) the nuclear matrix elements for this mode are known 
anyway. Both methods are capable of describing the 2v matrix elements, at 
least qualitatively. These quantities, when expressed in natural units based 
on the sum rules, are very small. Hence their description depends on small 
components of the nuclear wave functions and is therefore challenging. In 
QRPA the agreement is achieved if the effective proton-neutron interaction 
coupling constant (usually called gpp) is slightly (by ~ 10 - 20 %) adjusted. 

The theoretical description for the more interesting 0v mode cannot 
use any known nuclear observables, since there are no observables directly 
related to the MO”. It is therefore much less clear how to properly esti- 
mate the uncertainty associated with the calculated values of M°”, and to 
judge their accuracy. Since the calculations using QRPA are much simpler, 
an overwhelming majority of the published calculations uses that method. 
There are suggestions to use the spread of these published values of M°” 
as a measure of uncertainty.” Following this, one would conclude that the 
uncertainty is quite large, a factor of three or as much as five. But that 
way of assigning the uncertainty is questionable. Using all or most of the 
published values of M°” means that one includes calculations of uneven 
quality. Some of them were devoted to the tests of various approximations, 
and concluded that they are not applicable. Some insist that other data, 
like the M?” , are correctly reproduced, other do not pay any attention to 
such test. Also, different forms of the transition operator O” are used, in 
particular some works include approximately the effect of the short range 
nucleon-nucleon repulsion, while others neglect it. 

In contrast, in Ref. 38 an assesment of uncertainties in the matrix el- 
ements M°” inherent in the QRPA was made, and it was concluded that 
with a consistent treatment the uncertainties are much less, perhaps only 
about 30% (see Fig. 10.13). That calculation uses the known 2v matrix el- 
ements in order to adjust the interaction constant mentioned above. There 
is a lively debate in the nuclear structure theory community, beyond the 
scope of these lectures, about this conclusion. 

It is of interest also to compare the resulting matrix elements of Rodin 
et al.28 based on QRPA and its generalizations, and those of the avail- 
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Fig. 10.13. Nuclear matrix elements and their variance for the indicated approximations 
(see Ref. 38). 


able most recent NSM evaluation.?? Note that the operators used in NSM 
evaluation do not include the induced nucleon currents that in QRPA re- 
duce the matrix element by about 30%. The QRPA*® and NSM? M°” are 
compared in Table 10.1. In the last column the NSM matrix elements are 
reduced by 30% to approximately account for the missing terms in the op- 
erator, and to make the comparison more meaningful. With this reduction, 
it seems that QRPA results are a bit larger in the lighter nuclei and a bit 
smaller in the heavier ones than the NSM results, but basically within the 
30% uncertainty estimate. Once the NSM calculations for the intermediate 
mass nuclei 9°Zr, !°°Mo and !!°Cd become available, one can make a more 
meaningful comparison of the two methods. 

When comparing the results shown in Table 10.1 as well as the results of 
other calculations (e.g. Refs. 40,41) with Fig. 10.9 it is important to notice 
a qualitative difference in the behaviour of the 2v and Ov matrix elements 
when going from one nucleus to another one. For 2v the matrix elements 
change rapidly, but for the Ov the variation is much more gentle (%°Zr is 
a notable exception, at least for QRPA). That feature, common to most 
calculations, if verified, would help tremendously in comparing the results 
or constraints from one nucleus to another one. 
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Table 10.1. Comparison of the cal- 
culated nuclear matrix elements M°” 
using the QRPA method*® and the 
NSM.?° In the last column the NSM 
values are reduced, divided by 1.3, to 
account approximately for the effects of 
the induced nucleon currents. 


Nucleus | QRPA | NSM | NSM/1.3 
Ge | 2.3-2.4 | 2.35 1.80 


82Se 1.9-2.1 | 2.26 1.74 
6 Zr 0.3-0.4 
100Mo | 1.1-1.2 
116Cq | 1.2-1.4 
130 Te 1.3 2.13 1.64 
136Xe | 0.6-1.0 | 1.77 1.36 


Once the nuclear matrix elements are fixed (by choosing your favorite set 
of results), they can be combined with the phase space factors (a complete 
list is available, e.g. in the monograph?) to obtain a half-life prediction for 
any value of the effective mass (mgg). It turns out that for a fixed (mgg) 
the half-lives of different candidate nuclei do not differ very much from each 
other (not more than by factors ~ 3 or so) and, for example, the boundary 
between the degenerate and inverted hierarchy mass regions corresponds to 
half-lives ~ 10?” years. Thus, the next generation of experiments, discussed 
above, should reach this region using several candidate nuclei, making the 
corresponding conclusions less nuclear model dependent. 


10.5. Neutrino magnetic moment and the distinction 
between Dirac and Majorana neutrinos 


Neutrino mass and magnetic moments are intimately related. In the or- 
thodox Standard Model neutrinos have a vanishing mass and magnetic mo- 
ments vanish as well. However, in the minimally extended SM containing 
gauge-singlet right-handed neutrinos the magnetic moment u, is nonvan- 
ishing and proportional to the neutrino mass, but unobservably small,** 


3eG F 
= —— m 
i V28r2 ” 
Here upg is the electron Bohr magneton, traditionally used as unit also for 


the neutrino magnetic moments. An experimental observation of a mag- 
netic moment larger than that given in eq. (10.19) would be an uneqivocal 


=3x 10- un . (10.19) 
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indication of physics beyond the minimally extended Standard Model. 

Laboratory searches for neutrino magnetic moments are typically based 
on the obsevation of the v — e scattering. Nonvanishing u, will be recog- 
nizable only if the corresponding electromagnetic scattering cross section 
is at least comparable to the well understood weak interaction cross sec- 
tion. The magnitude of u, (diagonal in flavor or transitional) which can be 
probed in this way is then given by 


1/2 
Pais Gas eer ek (—) , (10.20) 


HB = /2ra Me 
where T is the electron recoil kinetic energy. Considering realistic values 
of T, it would be difficult to reach sensitivities below ~ 1071! upg. Present 
limits are about an order of magnitude larger than that. 

Limits on u, can also be obtained from bounds on the unobserved energy 
loss in astrophysical objects. For sufficiently large u, the rate of plasmon 
decay into the vi pairs would conflict with such bounds. Since plasmons 
can also decay weakly into the vv pairs , the sensitivity of this probe is 
again limited by the size of the weak rate, leading to 


[ita] <i 

LB — Vra 
where wp is the plasmon frequency. Since usually (hwp)? < meT that 
limit is stronger than that given in eq. (10.20). Current limits on u, based 
on such considerations are ~ 107 t? upg. 

The interest in u, and its relation to neutrino mass dates from ~1990 
when it was suggested that the chlorine data on solar neutrinos show an 
anticorrelation between the neutrino flux and the solar activity character- 
ized by the number of sunspots. A possible explanation was suggested in 
Ref. 44 where it was proposed that a magnetic moment uy ~ 10700-1) up 
would cause a precession in solar magnetic field of the neutrinos emitted ini- 
tially as left-handed ve into unobservable right-handed ones. Even though 
later analyses showed that the correlation with solar acivity does not exist, 
the possibility of a relatively large uw, accompanied by a small mass m, was 


hwp , (10.21) 


widely discussed and various models accomplishing that were suggested. 
If a magnetic moment is generated by physics beyond the Standard 
Model (SM) at an energy scale A, we can generically express its value as 
m~ Ë, (10.22) 
where e is the electric charge and G contains a combination of coupling 
constants and loop factors. Removing the photon from the diagram gives 
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a contribution to the neutrino mass of order 
my ~ GA. (10.23) 
We thus arrive at the relationship 


Puw _ w 
2Me HB 10-1815 


[A(TeV)]? eV, (10.24) 


My ~ 


which implies that it is difficult to simultaneously reconcile a small neutrino 
mass and a large magnetic moment. 

This naive restriction given in eq. (10.24) can be overcome via a careful 
choice for the new physics, e.g., by requiring certain additional symme- 
tries. 50 Note, however, that these symmetries are typically broken by 
Standard Model interactions. For Dirac neutrinos such symmetry (under 
which the left-handed neutrino and antineutrino v and v° transform as a 
doublet) is violated by SM gauge interactions. For Majorana neutrinos 
analogous symmetries are not broken by SM gauge interactions, but are 
instead violated by SM Yukawa interactions, provided that the charged 
lepton masses are generated via the standard mechanism through Yukawa 
couplings to the SM Higgs boson. This suggests that the relation between 
Hv and m, is different for Dirac and Majorana neutrinos. This distinction 
can be, at least in principle, exploited experimentally, as shown below. 

Earlier, I have quoted the Ref. 17 (see Fig. 10.4) to stress that observa- 
tion of the 0vGG decay would necessarily imply the existence of a novanish- 
ing neutrino Majorana mass. Analogous considerations can be applied in 
this case. By calculating neutrino magnetic moment contributions to my 
generated by SM radiative corrections, one may obtain in this way general, 
“naturalness” upper limits on the size of neutrino magnetic moments by 
exploiting the experimental upper limits on the neutrino mass. 

In the case of Dirac neutrinos, a magnetic moment term will generically 
induce a radiative correction to the neutrino mass of order?! 


My v —  [A(TeV)?? eV. (10.25) 


Taking A ~ 1 TeV and m, < 0.3 eV, we obtain the limit u, < 107 yp 
(and a more stringent one for larger A), which is several orders of magnitude 
more constraining than current experimental upper limits on up. 

The case of Majorana neutrinos is more subtle, due to the relative flavor 
symmetries of m, and u, respectively. For Majorana neutrinos the transi- 
tion magnetic moments |j,],,g (the only possible ones) are antisymmetric 
in the flavor indices {a, 8}, while the mass terms [m,]ag are symmetric. 
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These different flavor symmetries play an important role in the limits, and 
are the origin of the difference between the magnetic moment constraints 
for Dirac and Majorana neutrinos. 

It has been shown in Ref 52 that the constraints on Majorana neutrinos 
are significantly weaker than those for Dirac neutrinos,°! as the different 
flavor symmetries of m, and u, lead to a mass term which is suppressed 
only by charged lepton masses. This conclusion was reached by considering 
one-loop mixing of the magnetic moment and mass operators generated 
by Standard Model interactions. The authors of Ref. 52 found that if a 
magnetic moment arises through a coupling of the neutrinos to the neutral 
component of the SU(2);, gauge boson, the constraints for re and Hry 
are comparable to present experiment limits, while the constraint on jie, is 
significantly weaker. Thus, the analysis of Ref. 52 lead to a bound for the 
transition magnetic moment of Majorana neutrinos that is less stringent 
than present experimental limits. 

Even more generally it was shown in Ref. 53 that two-loop matching 
of mass and magnetic moment operators implies stronger constraints than 
those obtained in°? if the scale of the new physics A > 10 TeV. Moreover, 
these constraints apply to a magnetic moment generated by either the hy- 
percharge or SU(2), gauge boson. In arriving at these conclusions, the 
most general set of operators that contribute at lowest order to the mass 
and magnetic moments of Majorana neutrinos was constructed, and model 
independent constraints which link the two were obtained. Thus the results 
of Ref. 53 imply completely model independent naturalness bound that — 
for A > 100 TeV - is stronger than present experimental limits (even for 
the weakest constrained element Heu). On the other hand, for sufficiently 
low values of the scale A the known small values of the neutrino masses 
do not constrain the magnitude of the transition magnetic moment u, for 
Majorana neutrinos more than the present experimental limits. Thus, if 
these conditions are fulfilled, the discovery of u, might be forthcoming any 
day. 

The above result means that an experimental discovery of a magnetic 
moment near the present limits would signify that (i) neutrinos are Majo- 
rana fermions and (ii) new lepton number violating physics responsible for 
the generation of u, arises at a scale A which is well below the see-saw scale. 
This would have, among other things, implications for the mechanism of 
the neutrinoless double beta decay and lepton flavor violation as discussed 
above and in Ref. 22. 
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10.6. Summary 


In these lectures I discussed the status of double beta decay, its relation to 
the charge conjugation symmetry of neutrinos and to the problem of the 
lepton number conservation in general. I have shown that if one makes 
the minimum assumption that the light neutrinos familiar from the oscilla- 
tion experiments which are interacting by the left-handed weak current are 
Majorana particles, then the rate of the OvGG decay can be related to the 
absolute scale of the neutrino mass in a straightforward way. 

On the other hand, it is also possible that the 0vGG decay is mediated 
by the exchange of heavy particles. I explained that if the corresponding 
mass scale of such hypothetical particles is ~ 1 TeV, the corresponding 
Ov decay rate could be comparable to the decay rate associated with the 
exchange of a light neutrino. I further argued that the study of the lepton 
flavor violation involving  — e conversion and u — e+ y decay may be 
used as a “diagnostic tool” that could help to decide which of the possible 
mechanisms of the Ov decay is dominant. 

Further, I have shown that the the range of the effective masses (mga) 
can be roughly divided into three regions of interest, each corresponding to a 
different neutrino mass pattern. The region of (mgg) > 0.1 eV corresponds 
to the degenerate mass pattern. Its exploration is well advanced, and one 
can rather confidently expect that it will be explored by several 3G decay 
experiments in the next 3-5 years. This region of neutrino masses (or most 
of it) is also accessible to studies using the ordinary 3 decay and/or the 
observational cosmology. Thus, if the nature is kind enough to choose this 
mass pattern, we will have a multiple ways of exploring it. 

The region of 0.01 < (mgg) < 0.1 eV is often called the ” inverted 
mass hierarchy” region. In fact, both the inverted and the quasi-degenerate 
but normal mass orderings are possible in this case, and experimentally 
indistinguishable. Realistic plans to explore this region using the 0vZG 
decay exist, but correspond to a longer time scale of about 10 years. They 
require much larger, ~ ton size G3 sources and correspondingly even more 
stringent background suppression. 

Finally, the region (mgg) < 0.01 eV corresponds to the normal hierarchy 
only. There are no realistic proposals at present to explore this mass region 
experimentally. 

Intimately related to the extraction of (mgg) from the decay rates is 
the problem of nuclear matrix elements. At present, there is no consensus 
among the nuclear theorists about their correct values, and the correspond- 
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ing uncertainty. I argued that the uncertainty is less than some suggest, 
and that the closeness of the Quasiparticle Random Phase Approximation 
(QRPA) and Shell Model (NSM) results are encouraging. But this is still 
a problem that requires further improvements. 

In the last part I discussed the neutrino magnetic moments. I have 
shown that using the Standard Model radiative correction one can calcu- 
late the contribution of the magnetic moment to the neutrino mass. That 
contribution, naturally, should not exceed the experimental upper limit on 
the neutrino mass. Using this procedure one can show that the magnetic 
moment of Dirac neutrinos cannot exceed about 10715 upg, which is several 
orders of magnitudes less than the current experimental limits on uy. On 
the other hand, due to the different symmetries of the magnetic moment 
and mass matrices for Majorana neutrinos, the corresponding constraints 
are much less restrictive, and do not exceed the current limits. Thus, a 
discovery of u, near the present experimental limit would indicate that 
neutrinos are Majorana particles, and the corresponding new physics scale 
is well below the GUT scale. 
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Supersymmetry in Elementary Particle Physics 


Michael E. Peskin 


Stanford Linear Accelerator Center, Stanford University 
2575 Sand Hill Road, Menlo Park, California 94025 USA 


These lectures give a general introduction to supersymmetry, empha- 
sizing its application to models of elementary particle physics at the 
100 GeV energy scale. I discuss the following topics: the construction of 
supersymmetric Lagrangians with scalars, fermions, and gauge bosons, 
the structure and mass spectrum of the Minimal Supersymmetric Stan- 
dard Model (MSSM), the measurement of the parameters of the MSSM 
at high-energy colliders, and the solutions that the MSSM gives to the 
problems of electroweak symmetry breaking and dark matter. 


11.1. Introduction 


11.1.1. Overview 


It is an exciting time now in high-energy physics. For many years, ever 
since the Standard Model was established in the late 1970’s, the next log- 
ical question in the search for the basic laws of physics has been that of 
the mechanism by which the weak interaction gauge symmetry is sponta- 
neously broken. This seemed at the time the one important gap that kept 
the Standard Model from being a complete theory of the strong, weak,and 
electromagnetic interactions [1-3]. Thirty years later, after many precision 
experiments at high-energy e*e~ and hadron colliders, this is still our sit- 
uation. In the meantime, another important puzzle has been recognized, 
the fact that 80% of the mass in the universe is composed of ‘dark mat- 
ter’, a particle species not included in the Standard Model. Both problems 
are likely to be solved by new fundamental interactions operating in the 
energy range of a few hundred GeV. Up to now, there is no evidence from 
particle physics for such new interactions. But, in the next few years, this 
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situation should change dramatically. Beginning in 2008, the CERN Large 
Hadron Collider (LHC) should give us access to physics at energies well 
above 1 TeV and thus should probe the energy region responsible for elec- 
troweak symmetry breaking. Over a longer term, we can look forward to 
precision experiments in e*e~ annihilation in this same energy region at 
the proposed International Linear Collider (ILC). 

Given this expectation, it is important for all students of elementary 
particle physics to form concrete ideas of what new phenomena we might 
find as we explore this new energy region. Of course, we have no way of 
knowing exactly what we will find there. But this makes it all the more 
important to study the alternative theories that have been put forward and 
to understand their problems and virtues. 

Many different models of new physics relevant to electroweak symmetry 
breaking are being discussed at this TASI school. Among these, supersym- 
metry has pride of place. Supersymmetry (or SUSY) provides an explicit 
realization of all of the aspects of new physics expected in the hundred 
GeV energy region. Because SUSY requires only weak interactions to build 
a realistic theory, it is possible in a model with SUSY to carry out ex- 
plicit calculations and find the answers that the model gives to all relevant 
phenomenological questions. 

In these lectures, I will give an introduction to supersymmetry as a con- 
text for building models of new physics associated with electroweak symme- 
try breaking. Here is an outline of the material: In Section 2, I will develop 
appropriate notation and then construct supersymmetric Lagrangians for 
scalar, spinor, and vector fields. In Section 3, I will define the canonical 
phenomenological model of supersymmetry, the Minimal Supersymmetric 
Standard Model (MSSM). I will discuss the quantum numbers of new par- 
ticles in the MSSM and the connection of the MSSM to the idea of grand 
unification. 

The remaining sections of these lectures will map out the phenomenol- 
ogy of the new particles and interactions expected in models of supersym- 
metry. I caution you that I will draw only those parts of the map that 
cover the simplest and most well-studied class of models. Supersymmetry 
has an enormous parameter space which contains many different scenarios 
for particle physics, more than I have room to cover here. I will at least try 
to indicate the possible branches in the path and give references that will 
help you follow some of the alternative routes. 

With this restriction, the remaining sections will proceed as follows: In 
Section 4, I will compute the mass spectrum of the MSSM from its param- 
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eters. I will also discuss the parameters of the MSSM that characterize 
supersymmetry breaking. In Section 5, I will describe how the MSSM pa- 
rameters will be measured at the LHC and the ILC. Finally, Section 6 will 
discuss the answers that supersymmetry gives to the two major questions 
posed at the beginning of this discussion, the origin of electroweak symme- 
try breaking, and the origin of cosmic dark matter. 

Although I hope that these lectures will be useful to students in studying 
supersymmetry, there are many other excellent treatments of the subject 
available. A highly recommended introduction to SUSY is the ‘Supersym- 
metry Primer’ by Steve Martin [6]. An excellent presentation of the for- 
malism of supersymmetry is given in the texbook of Wess and Bagger [7]. 
Supersymmetry has been reviewed at previous TASI schools by Bagger [8], 
Lykken [9], and Kane [10], among others. Very recently, three textbooks of 
phenomenological supersymmetry have appeared, by Drees, Godbole, and 
Roy [11], Binetruy [12], and Baer and Tata [13]. A fourth textbook, by 
Dreiner, Haber, and Martin [14], is expected soon. 

It would be wonderful if all of these articles and books used the same 
conventions, but that is too much to expect. In these lectures, I will use 
my own, somewhat ideosyncratic conventions. These are explained in Sec- 
tion 2.1. Where possible, within the philosophy of that section, I have 
chosen conventions that agree with those of Martin’s primer [6]. 


11.1.2. Motivation and Structure of Supersymmetry 


If we propose supersymmetry as a model of electroweak symmetry breaking, 
we might begin by asking: What is the problem of electroweak symmetry 
breaking, and what are the alternatives for solving it? 

Electroweak symmetry is spontaneously broken in the minimal form 
of the Standard Model, which I will refer to as the MSM. However, the 
explanation that the MSM gives for this phenomenon is not satisfactory. 
The sole source of symmetry breaking is a single elementary Higgs boson 
field. All mass of quarks, leptons, and gauge bosons arise from the couplings 
of those particles to the Higgs field. 

To generate symmetry breaking, we postulate a potential for the Higgs 
field 


V =p’ ly)? +All , (11.1) 


shown in Fig. 11.1. The assumption that u? < 0 is the complete explanation 
for electroweak symmetry breaking in the MSM. Since y is a renormaliz- 
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able coupling of this theory, the value of u cannot be computed from first 
principles, and even its sign cannot be predicted. 


V 


Fig. 11.1. The Standard Model Higgs potential (11.1). 


In fact, this explanation has an even worse feature. The parameter 
u? receives large additive radiative corrections from loop diagrams. For 
example, the two diagrams shown in Fig. 11.2 are ultraviolet divergent. 


Supplying a momentum cutoff A, the two diagrams contribute 


2 = 142 AÀ 42_ 3% 92 te 11.2 

H = Ebare t 8m2 B72 + (11.2) 
If we view the MSM as an effective theory, A should be taken to be the 
largest momentum scale at which this theory is still valid. The presence 
of large additive corrections implies that the criterion u? < 0 is not a 
simple condition on the underlying parameters of the effective theory. The 
radiative corrections can easily change the sign of y?. Further, if we insist 
that the MSM has a large range of validity, the corrections become much 
larger than the desired result. To obtain the Higgs field vacuum expectation 
value required for the weak interactions, |u| should be about 100 GeV. If 
we insist at the same time that the MSM is valid up to the Planck scale, 
A ~ 101° GeV, the formula (11.2) requires a cancellation between the bare 
value of u and the radiative corrections in the first 36 decimal places. This 
problem has its own name, the ‘gauge hierarchy problem’. But, to my mind, 
the absence of a logical explantion for electroweak symmetry breaking in 
the MSM is already problem enough. 

How could we solve this problem? There are two different strategies. 
One is to look for new strong-couplings dynamics at an energy scale of 1 TeV 
or below. Then the Higgs field could be composite and its potential could be 
the result, for example, of pair condensation of fermion constituents. Higgs 
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Fig. 11.2. Two Standard Model diagrams that give divergent corrections to the 
Higgs mass parameter u”. 


actually proposed that his field was a phenomenological description of a 
fermion pair condensation mechanism similar to that in superconductiv- 
ity [4]. Sometime later, Susskind [2] and Weinberg [3] proposed an explicit 
model of electroweak symmetry breaking by new strong interactions, called 
‘technicolor’. 

Today, this approach is disfavored. Technicolor typically leads to flavor- 
changing neutral currents at an observable level, and also typically conflicts 
with the accurate agreement of precision electroweak theory with experi- 
ment. Specific models do evade these difficulties, but they are highly con- 
strained [5]. 

The alternative is to postulate that the electroweak symmetry is broken 
by a weakly-coupled Higgs field, but that this field is part of a model in 
which the Higgs potential is computable. In particular, the Higgs mass 
term u?|p|? should be generated by well-defined physics within the model. 
A prerequisite for this is that the u? term not receive additive radiative 
corrections. This requires that, at high energy, the appearance of a nonzero 
u? in the Lagrangian should be forbidden by a symmetry of the theory. 

There are three ways to arrange a symmetry that forbids the term 
u?|y|?. We can postulate a symmetry that shifts 


dbp=evu. (11.3) 

We can postulate a symmetry that connects y to a gauge field, whose mass 
can then forbidden by gauge symmetry 

dp=e-A. (11.4) 


We can postulate a symmetry that connects y to a fermion field, whose 
mass can then be forbidden by a chiral symmetry. 


dbp=e-y. (11.5) 
The options (11.3) and (11.4) lead, respectively, to ‘little Higgs’ models [15- 


17] and to models with extra space dimensions [18,19]. The third option 
leads to supersymmetry. This is the route we will now follow. 
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The symmetry (11.5) looks quite innocent, but it is not. In quantum 
theory, a symmetry that links a boson with a fermion is generated by a 
conserved charge Qa that carries spin-1/2 


Qa, P] = a , [Qa H] =0. (11.6) 


Such a Qa implies the existence of a conserved 4-vector charge Rm defined 
by 


{Qa QI} = 272 Rm (11.7) 


(It may not be obvious to you that there is no Lorentz scalar component in 
this anticommutator, but I will show this in Section 2.1.) The charge Rm 
is conserved, because both Q and Qi commute with H. It is nonzero, as we 
can see by taking the expectation value of (11.7) in any state and setting 


a= 


(A| {Qa, QU} |A) = (A] QaQ} |A) + (A| Qa Q}, |A) 
= ||Qa | A) ||? + IQ} 14) ||? . (11.8) 


This expression is non-negative; it can be zero only if Qa and Qi, annihilate 
every state in the theory. 

However, in a relativistic quantum field theory, we do not have the 
freedom to introduce arbitrary charges that have nontrivial Lorentz trans- 
formation properties. Conservation of energy-momentum and angular mo- 
mentum are already very constraining. For example, in two-body scatter- 
ing, the scattering amplitude for fixed center of mass energy can only be a 
function of one variable, the center of mass scattering angle 0. If one adds 
a second conserved 4-vector charge, almost all values of 0 will also be for- 
bidden. Coleman and Mandula proved a stronger version of this statement: 
In a theory with an addtional conserved 4-vector charge, there can be no 
scattering at all, and so the theory is trivial [20]. 

If we would like to have (11.5) as an exact symmetry, then, the only 
possibility is to set Rm = Pm. That is, the square of the fermionic charge 
Qa must be the total energy-momentum of everything. We started out 
trying to build a theory in which the fermionic charge acted only on the 
Higgs field. But now, it seems, the fermionic charge must act on every field 
in the theory. Everything—quarks, leptons, gauge bosons, even gravitons— 
must have partners under the symmetry generated by Qa. Qa is fermionic 
and carries spin L. Then every particle in the theory must have a partner 
with the opposite statistics and spin differing by $ unit. 
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The idea that the transformation (11.5) leads to a profound general- 
ization of space-time symmetry was discovered independently several times 
in the early 1970’s [22,23]. The 1974 paper by Wess and Zumino [24] 
which gave simple linear realizations of this algebra on multiplets of fields 
launched the detailed exploration of this symmetry and its application to 
particle physics. 

The pursuit of (11.5) then necessarily leads us to introduce a very large 
number of new particles. This seems quite daunting. It might be a reason 
to choose one of the other paths, except that these also lead to new physics 
models of similarly high complexity. I encourage you to carry on with this 
line of analysis a bit longer. It will lead to a beautiful structure with many 
interesting implications for the theory of Nature. 


11.2. Formalism of Supersymmetry 


11.2.1. Fermions in 4 Dimensions 


To work out the full consequences of (11.5), we will need to write this 
equation more precisely. To do this, we need to set up a formalism that 
describes relativistic fermions in four dimensions in the most general way. 
There is no general agreement on the best conventions to use, but every 
discussion of supersymmetry leans heavily on the particular choices made. 
I will give my choice of conventions in this section. 

There are two basic spin-4 representations of the Lorentz group. Each 
is two-dimensional. The transformation laws are those of left- and right- 
handed Weyl (2-component) fermions, 


yr > (1 id 8/2- Ë- F/2) 4r 

Yr > (1—id- &/24+ 8-&/2) dp , (11.9) 
where @ is an infinitesimal rotation angle and 8 is an infinitesimal boost. 
The four-component spinor built from these ingredients, UV = (YL, YR), is 


a Dirac fermion. 
Define the matrix 


ee ee ee 
c= —io S o): (11.10) 


This useful matrix satisfies c? = —1, cT = —c. The combination 


Wircat = —€opViLaVP218 (11.11) 
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is the basic Lorentz invariant product of spinors. Many treatments of su- 
persymmetry, for example, that in Wess and Bagger’s book [7], represent c 
implicitly by raising and lowering of spinor indices. I will stick to this more 
prosaic approach. 

Using the identity c = —c(&)", it is easy to show that the quantity 
(—cyw7) transforms like wr. So if we wish, we can replace every Yr by 
a wy, and write all fermions in the theory as left-handed Weyl fermions. 
With this notation, for example, we would call ez and e} fermions and ep 
and ek antifermions. This convention does not respect parity, but parity 
is not a symmetry of the Standard Model. The convention of representing 
all fermions in terms of left-handed Weyl fermions turns out to be very 
useful for not only for supersymmetry but also for other theories of physics 
beyond the Standard Model. 

Applying this convention, a Dirac fermion takes the form 


v=( vız ) (11.12) 


—cey ar 


Write the Dirac matrices in terms of 2 x 2 matrices as 
y= P a (11.13) 
with 
o” =(1,® g” =(1,-8)” — co™ = (o")te (11.14) 
Then the Dirac Lagrangian can be rewritten in the form 
L£ = Vig : 0Y — MYY 
= yi ia -Owin + yiio - Oar 
—(mobi{peber z m*yircyžr) : (11.15) 


For the bilinears in the last line, we can use fermion anticommutation and 
the antisymmetry of c to show 


bipeber = +per - (11.16) 
and, similarly, 
(Tepr) = yiri = bl ebay - (11.17) 


The mass term looks odd, because it is fermion number violating. However, 
the definition of fermion number is that given in the previous paragraph. 
The fields #1, and wey annihilate, respectively, ez and ef. So this mass 
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term generates the conversion of e; to eg, which is precisely what we would 
expect a mass term to do. 

If we write all fermions as left-handed Weyl fermions, the possibilities for 
fermion actions are highly restricted. The most general Lorentz-invariant 
free field Lagrangian takes the form 


1 
L = Ppio - Op — 3 (man Cb = ms, pew) n (11.18) 


where j, k index the fermion fields. Here and in the rest of these lectures, I 
drop the subscript L. The matrix mjk is a complex symmetric matrix. For 
a Dirac fermion, 


m o0 


Mak = ( i ‘io (11.19) 
jk 
as we have seen in (11.15). This matrix respects the charge 


Qui = +41, Q2 = =y , (11.20) 


which is equivalent to the original Dirac fermion number. A Majorana 
fermion is described in the same formalism by the mass matrix 


Mik = mMôjk : (11.21) 


The most general fermion mass is a mixture of Dirac and Majorana terms. 
We will meet such fermion masses in our study of supersymmetry. These 
more general mass matrices also occur in other new physics models and in 
models of the masses of neutrinos. 

The SUSY charges are four-dimensional fermions. The minimum set 
of SUSY charges thus includes one Weyl fermion Qa and its Hermitian 
conjugate QÏ. We can now analyze the anticommutator {QanQ)}. Since 
the indices belong to different Lorentz representations, this object does not 
contain a scalar. The indices transform as do the spinor indices of o™, and 
so we can rewrite (11.7) with R™ = P™ as 


{Qa Q} = 26% Pa: (11.22) 


It is possible to construct quantum field theories with larger supersym- 
metry algebras. These must include (11.22), and so the general form is [21] 


(Q = 2P (11.23) 


for i,j = 1... N. This relation can be supplemented by a nontrivial anti- 
commutator 


{Qi QÍ} = 2eag Q” (11.24) 
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where the central charge QË is antisymmetric in [ij]. Theories with N > 4 
necessarily contain particles of spin greater than 1. Yang-Mills theory with 
N = 4 supersymmetry is an especially beautiful model with exact scale in- 
variance and many other attractive formal properties [25]. In these lectures, 
however, I will restrict myself to the minimal case of N = 1 supersymmetry. 

I will discuss supersymmetry transformations using the operation on 
fields 


Sed = [ET cQ + Qieé*, 8] . (11.25) 


Note that the operator ôg contains pairs of anticommuting objects and so 
obeys commutation rather than anticommutation relations. The operator 
Pm, acts on fields as the generator of translations, Pm = 10m. Using this, 
we can rewrite (11.22) as 


[de Ôn] = 2i (ETN — GE) Om (11.26) 


I will take this equation as the basic (anti)-commutation relation of super- 
symmetry. In the next two sections, I will construct some representations 
of this commutation relation on multiplets of fields. 


11.2.2. Supersymmetric Lagrangians with Scalars and Fermions 


The simplest representation of the supersymmetry algebra (11.26) directly 
generalizes the transformation (11.5) from which we derived the idea of 
supersymmetry. The full set of fields required includes a complex-valued 
boson field ¢ and a Weyl fermion field 7. These fields create and destroy 
a scalar particle and its antiparticle, a left-handed massless fermion, and 
its right-handed antiparticle. Note that the particle content has an equal 
number of fermions and bosons. This particle content is called a chiral 
supermultiplet. 

I will now write out the transformation laws for the fields corresponding 
to a chiral supermultiplet. It is convenient to add a second complex-valued 
boson field F that will have no associated particles. Such a field is called 
an auailiary field. We can then write the transformations that generalize 
(11.5) as 


deb = VET cy 
deb = V2io"cE* nd + VIF £ 
6¢F = -Vi T Om . (11.27) 
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The conjugates of these transformations are 


de® = -V 2y cg 
de! = V2iET co” Ind" + V2ETF* 
6¢F* = Viðm TE . (11.28) 


These latter transformations define the antichiral supermultiplet. I claim 
that the transformations (11.27) and (11.28), first, satisfy the fundamental 
commutation relation (11.26) and, second, leave a suitable Lagrangian in- 
variant. Both properties are necessary, and both must be checked, in order 
for a set of transformations to generate a symmetry group of a field theory. 

The transformation laws (11.27) seem complicated. You might won- 
der if there is a formalism that generates these relations automatically and 
manipulates them more easily than working with the three distinct com- 
ponent fields (ġ, Y, F). In the next section, I will introduce a formalism 
called superspace that makes it almost automatic to work with the chiral 
supermultiplet. However, the superspace description of the multiplet con- 
taining gauge fields is more complicated, and the difficulty of working with 
superspace becomes exponentially greater in theories that include gravity, 
higher dimensions, or N > 1 supersymmetry. At some stage, one must go 
back to components. I strongly recommend that you gain experience by 
working through the component field calculations described in these notes 
in full detail, however many large pieces of paper that might require. 

To verify each of the two claims I have made for (11.27) requires a little 
calculation. Here is the check of the commutation relation applied to the 
field ¢: 


[5¢, dn] = ðe(V 207 eb) — (E > n) 
= Vn" e(V2io” c£ np) — (E > n) 
= —2in" (5) E ane — (E = n) 
= 2i[€1o"'n — nT iano 
(11.29) 


The check of the commutation relation applied to F is equally straightfor- 
ward. The check on w is a bit lengthier. It requires a Fierz identity, that 
is, a spinor index rearrangment identity. Specifically, we need 


1 
Magy = —5('Tmn)oae » (11.30) 
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which you can derive by writing out the four components explicitly. After 
some algebra that involves the use of this identity, you can see that the 
SUSY commutation relation applied to vw also takes the correct form. 
Next, I claim that the Lagrangian 
L = 8" f Imo + pia - Op + F*F (11.31) 
is invariant to the transformation (11.27). I will assume that the Lagrangian 
(11.31) is integrated f d‘z and use integration by parts freely. Then 
eL = Ob" Iml VZET cp) + (—V 20 Wl cé*) Ob 
+ylio - A[V2io”" c&* 3mp + VEF] 
+[V2i0, "EF co” + V2 F* Jia - Ow 
4 F*[-V2iE T Om] + [V2Imyta | F 
= —G* V 2E COP L + V20nb* ET co T” On Om 
+V 20h! 07h — V2pta™ o” cE OmOnd 
+i T Om FE + Viðm TEF 
Vitt FO" Om) + VF ET Amy 
=0. (11.32) 


In the final expression, the four lines cancel line by line. In the first two 
lines, the cancellation is made by using the identity (a - ô) (o - 0) = 07. 

So far, our supersymmetry Lagrangian is just a massless free field theory. 
However, it is possible to add rather general interactions that respect the 
symmetry. Let W(¢) be an analytic function of ¢, that is, a function that 
depends on ¢ but not on ¢*. Let 


ow 1 oW 
Op 2 O¢? 


I claim that Cy is invariant to (11.27). Then we can add (Lw + Lie) 
to the free field Lagrangian to introduce interactions into the theory. The 
function W is called the superpotential. 

We can readily check that Lw is indeed invariant: 


Lw =F wr cy (11.33) 


Win ew 
ôsLw =F TE (VET ey) — V2FET cy Ip 
—V2iéto™ mae - iT eV Bock nd a 
w oes : (11.34) 


0¢? 
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The second line rearranges to 


oW Ow 
—V 21'S | Ony— + ¥0,¢— |, 11.35 
Vaigo (Av Ge + Vnd TE | (11.35) 
which is a total derivative. The third line is proportional to ~awewy, which 
vanishes by fermion antisymmetry since the spinor indices take only two 
values. Thus it is true that 


beLw =0. (11.36) 


The proofs of invariance that I have just given generalize straightfor- 
wardly to systems of several chiral supermultiplets. The requirement on 
the superpotential is that it should be an analytic function of the complex 
scalar fields ¢,. Then the following Lagrangian is supersymmetric: 


L = I" pk 3mpr + wlio - Ipp + Fi Fe + LW + Lh, , (11.37) 
where 
aw 1 ew 
=F T ; 11. 
Lw ead; 505 CUR IAIN (11.38) 


In this Lagrangian, the fields Fk are Lagrange multipliers. They obey the 
constraint equations 


Tea (11.39) 


Using these equations to eliminate the Fk, we find an interacting theory 
with the fields ¢, and Yk, a Yukawa coupling term proportional to the 
second derivative of W, as given in (11.38), and the potential energy 


(11.40) 


I will refer to Vr as the F-term potential. Later we will meet a second 
contribution Vp, the D-term potential. These two terms, both obtained by 
integrating out auxiliary fields, make up the classical potential energy of a 
general supersymmetric field theory of scalar, fermion, and gauge fields. 

The simplest example of the F-term potential appears in the theory 
with one chiral supermultiplet and the superpotential W = img’. The 
constraint equation for F is [27] 


F* = -mọ . (11.41) 
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After eliminating F, we find the Lagrangian 
1 
L = 0 Ong — |m|?¢*o4 pig - dy — alm ew —m*plep*) (11.42) 


This is a theory of two free scalar bosons of mass |m| and a free Majorana 
fermion with the same mass |m|. The Majorana fermion has two spin states, 
so the number of boson and fermion physical states is equal, as required. 

The form of the expression (11.40) implies that Vr > 0, and that Vr = 0 
only if all Fg = 0. This constraint on the potential energy follows from a 
deeper consideration about supersymmetry. Go back to the anticommuta- 
tion relation (11.22), evaluate it for a = 8, and take the vacuum expectation 
value. This gives 


(0 {Qa, Qh} |0) = (0| (H — P?) 0) = (0| H |0) , (11.43) 


since the vacuum expectation value of P? vanishes by rotational invariance. 
Below (11.7), I argued that the left-hand side of this equation is greater than 
or equal to zero. It is equal to zero if and only if 


Qa |0) = Qh |0) = 0 (11.44) 


The formulae (11.44) give the criterion than the vacuum is invariant under 
supersymmetry. If this relation is not obeyed, supersymmetry is sponta- 
neously broken. Taking the vacuum expectation value of the transformation 
law for the chiral representation, we find 


(0| [E7 cQ + Qteé*, We] |0) = (0| V2io” E" Onde + EF |0) 
= €(0| Fi, |0) - (11.45) 


In the last line I have used the fact that the vacuum expectation value of 
ọ(x) is translation invariant, so its derivative vanishes. The left-hand side 
of (11.45) vanishes if the vacuum state is invariant under supersymmetry. 

The results of the previous paragraph can be summarized in the fol- 
lowing way: If supersymmetry is a manifest symmetry of a quantum field 
theory, 


(0| H|0) =0, and (0| Fẹ |0) = 0 (11.46) 
for every F field of a chiral multiplet. In complete generality, 
(0| H 0) >0. (11.47) 


The case where (H) is positive and nonzero corresponds to spontaneously 
broken supersymmetry. If the theory has a state satisfying (11.44), this is 
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necesssarily the state in the theory with lowest energy. Thus, supersym- 
metry can be spontaneously broken only if a supersymmetric vacuum state 
does not exist® 

For the moment, we will work with theories that preserve supersym- 
metry. I will give examples of theories with spontaneous supersymmetry 
breaking in Section 3.5. 

The results we have just derived are exact consequences of the commu- 
tation relations of supersymmetry. It must then be true that the vacuum 
energy of a supersymmetric theory must vanish in perturbation theory. 
This is already nontrivial for the free theory (11.42). But it is correct. The 
positive zero point energy of the boson field exactly cancels the negative 
zero point energy of the fermion field. With some effort, one can show the 
cancellation also for the leading-order diagrams in an interacting theory. 
Zumino proved that this cancellation is completely general [29]. 

I would like to show you another type of cancellation that is also seen in 
perturbation theory in models with chiral fields. Consider the model with 
one chiral field and superpotential 


À 
W= 3? (11.48) 
After eliminating F, the Lagrangian becomes 


L = 06* Amo + wlio - Ob — Abb ep — Obl ep*) — Alet. (11.49) 


The vertices of this theory are shown in Fig. 11.3(a). 

From our experience in (11.2), we might expect to find an addditive 
radiative correction to the scalar mass. The corrections to the fermion and 
scalar mass terms are given by the diagrams in Fig. 11.3(b). Actually, there 
are no diagrams that correct the fermion mass; you can check that there 
it is not possible to match the arrows appropriately. For the scalar mass 
correction, the two diagrams shown contribute 


„2 f dpi 1 ; . a io-p io? -(- 
—4ià | aa Ral aiya) f Et | pir (=P) | 
(11.50) 


Using o - pa - p = p? in the second term and then taking the trace, we see 


that these two contributions cancel precisely. In this way, supersymmetry 
really does control radiative corrections to the Higgs mass, following the 
logic that we presented in Section 1.2. 


aJt is possible that a supersymmetric vacuum state might exist but that a higher-energy 
vacuum state might be metastable. A model built on this metastable state would show 
spontaneous breaking of supersymmetry [26]. 
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Fig. 11.3. Perturbation theory for the supersymmetric model (11.49): (a) vertices 
of the model; (b) corrections to the fermion and scalar masses. 


In fact, it can be shown quite generally that not only the mass term but 
the whole superpotential W receives no additive radiative corrections in any 
order of perturbation theory [30]. For example, the one-loop corrections to 
quartic terms in the Lagrangian cancel in a simple way that is indicated 
in Fig. 11.4. The field strength renormalization of chiral fields can be 
nonzero, so the form of W can be changed by radiative corrections by 
the rescaling of fields. Examples are known in which W receives additive 
radiative corrections from nonperturbative effects [31]. 
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Fig. 11.4. Scheme of cancellations of one-loop corrections to the F-term potential. 


11.2.3. Superspace 


Because the commutation relations of supersymmetry include the gener- 
ators of translations, supersymmetry is a space-time symmetry. It is an 
attractive idea that supersymmetry is the natural set of translations on 
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a generalized space-time with commuting and anticommuting coordinates. 
In this section, I will introduce the appropriate generalization of space-time 
and use it to re-derive some of the results of Section 2.2. 

Consider, then, a space with four ordinary space-time coordinates x” 
and four anticommuting coordinates (a, 0a). I will take the coordinates 
Oa to transform as 2-component Weyl spinors; the ĝa are the complex 
conjugates of the 0a. This is superspace. A superfield is a function of these 
superspace coordinates: ®(z, 0,0). 

It is tempting to define supersymmetry transformations as translations 
0 — 0 + £. However, this does not work. These transformations commute, 
[d¢,6y] = 0, and we have seen in Section 1.2 that this implies that the S- 
matrix of the resulting field theory must be trivial. To construct a set of 
transformations with the correct commutation relations, we must write 


Sð = QO , (11.51) 


where 


o = o 
Q= (-% = io" dn E+E (= + i7"60m (11.52) 
This is a translation of the fermionic coordinates (0, 0) plus a translation 
of the ordinary space-time coordinates proportional to 6, 0. It is straight- 
forward to show that these operators satisfy 


[Qe, Qn] = —2i (Ea — NTE) On . (11.53) 


Despite the fact that this equation has an extra minus sign on the right-hand 
side with respect to (11.26), it is the relation that we want. (The difference 
is similar to that between active and passive transformations.) Combined 
with the decomposition of the superfield that I will introduce below, this 
relation will allow us to derive the chiral supermultiplet transformation laws 
(11.27). 

Toward this goal, we need one more ingredient. Define the superspace 
derivatives 


ð = = ð 
pee Agee Dan o 11s 
D0, i(00"™) 40 ries Jað, (11.54) 
such that (D,®)' = Da®'. These operators commute with Qe: 


Thus, we can constrain ® by the equation 


Dað=0 o D,&=0, (11.56) 
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and these constraints are consistent with supersymmetry. What we have 
just shown is that the general superfield (x, 0,0) is a reducible represen- 
tation of supersymmetry. It can be decomposed into a direct sum of three 
smaller representations, one constrained by the first of the relations (11.56), 
one constrained by the second of these relations, and the third containing 
whatever is left over in ® when these pieces are removed. 

Let us begin with the constraint Da® = 0. The solution of this equation 
can be written 


(x, 0,0) = O(a + i100"0,0) , (11.57) 


that is, this solution is parametrized by a general function of x and @. Since 
0 is a two-component anticommuting object, this general function of x and 
0 can be represented as 


®(x,0) = p(x) + V207 cyx) + 0° OF (a) . (11.58) 


The field content of this expression is exactly that of the chiral supermul- 
tiplet. The supersymmetry transformation of this field should be 


de® = Oc O(x + 1000, 0) . (11.59) 


It is straightforward to compute the right-hand side of (11.59) in terms of 
0, 0, and the component fields of (11.58). The coefficients of powers of 0 are 
precisely the supersymmetry variations given in (11.27). Thus a superfield 


satisfying 
Da =0 (11.60) 


is equivalent to a chiral supermultiplet, and the transformation (11.59) gives 
the supersymmetry transformation of this multiplet. A superfield satisfying 
(11.60) is called a chiral superfield. Similarly, a superfield satisfying 


Da® =0 (11.61) 


is called an antichiral superfield. This superfield has a component field de- 
composition (¢*,~*, F*), on which Q¢ induces the transformation (11.28). 
I will describe the remaining content of the general superfield ® in Sec- 
tion 2.5. 

A Lagrangian on Minkowski space is integrated over dfx. A superspace 
Lagrangian should be also be integrated over the 0 coordinates. Integration 
over fermionic coordinates is defined to be proportional to the coefficient of 
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the highest power of 0. I will define integration over superspace coordinates 
by the formulae 


fen = f #90, =0 f ENE) =1 (11.62) 


and their conjugates. To use these formulae, expand the superfields in 
powers of 0 and pick out the terms proportional to (0Tc0). Then, if ® is a 
chiral superfield constrained by (11.60) and W(®) is an analytic function 
of ®, 


J 20 ®(2,9) = F(x) 


; 7 OW 1 p Ww 
fa 0 W(®) = Fa) ag i Yg (11.63) 


where, in the second line, W on the right-hand side is evaluated with ® = 
(x). With somewhat more effort, one can show 


peo fee BIS =O" HP Omo+ Wig - OW+ FF . (11.64) 


These formulae produce the invariant Lagrangians of chiral supermul- 
tiplets from a superspace point of view. The most general Lagrangian of 
chiral superfields ®; takes the form 


L= J torwa) + | eowa+ f ea wa) À (11.65) 


where W(®) is an analytic function of complex superfields and K(®, ®t) is 
a general real-valued function of the superfields. The Lagrangian (11.37) 
is generated from this expression by taking K(®, ®t) = Di Dp. The most 
general renormalizable Lagrangian of chiral supermultiplets is obtained by 
taking K to be of this simple form and taking W to be a polynomial of 
degree at most 3. 

Because the integral d?0 exposes the Lagrange multiplier F in (11.58), I 
will refer to a term with this superspace integral as an F-term. For similar 
reasons that will become concrete in the next section, I will call a term with 
a dt0 integral a D-term. 

In the remainder of these lectures, I will restrict myself to discussing 
renormalizable supersymmetric theories. But, still, it is interesting to ask 
what theories we obtain when we take more general forms for K. The 
Lagrangian for ¢ turns out to be a nonlinear sigma model for which the 
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target space is a complex manifold with the metric [32] 

z= -l ie 61) (11.66) 

Imn = BEmA|pin : i 

A complex manifold whose metric is derived from a potential in this way 
is called a Kähler manifold. The function K is the Kähler potential. It is 
remarkable that, wherever in ordinary quantum field theory we find a gen- 
eral structure from real analysis, the supersymmetric version of the theory 
has a corresponding complex analytic structure. 

Now that we have a Lagrangian in superspace, it is possible to derive 
Feynman rules and compute Feynman diagrams in superspace. I do not 
have space here to discuss this formalism; it is discussed, for example, 
in [7] and [30]. I would like to state one important consequence of this 
formalism. It turns out that, barring some special circumstances related to 
perturbation theory anomalies, these Feynman diagrams always generate 
corrections to the effective Lagrangian that are D-terms, 


J toxa) l (11.67) 


The perturbation theory does not produce terms that are integrals f d?0. 
This leads to an elegant proof of the result cited at the end of the pre- 
vious section that the superpotential is not renormalized at any order in 
perturbation theory [30]. 


11.2.4. Supersymmetric Lagrangians with Vector Fields 


To construct a supersymmetric model that can include the Standard Model, 
we need to be able to write supersymmetric Lagrangians that include Yang- 
Mills vector fields. In this section, I will discuss how to do that. 

To prepare for this discussion, let me present my notation for gauge 
fields in a general quantum field theory. The couplings of gauge bosons to 
matter are based on the covariant derivative, which I will write as 


Din = (Om — igAn th)? (11.68) 
for a field ¢ that belongs to the representation R of the gauge group G. In 
this formula, th are the representation matrices of the generators of G in 
the representation R. These obey 

ett] = af es (11.69) 


The coefficients f@° are the structure constants of G. They are independent 
of R; essentially, their values define the multiplication laws of G. They can 
be taken to be totally antisymmetric. 
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The generators of G transform under G according to a representation 
called the adjoint representation. I will denote this representation by R = 
G. Its representation matrices are 


(aust (11.70) 


These matrices satisfy (11.69) by virtue of the Jacobi identity. The covari- 
ant derivative on a field in the adjoint representation takes the form 


Dy B® = mD! + gf% AL O° (11.71) 


The field strengths F$ 


4, are defined from the covariant derivative (in any 


representation) by 


[Pins Dn] = —igF%,,t% - (11.72) 


mn 


This gives the familiar expression 
Fin = mAAR — On AG, + gf” ARAR - (11.73) 


Now we would like to construct a supersymmetry multiplet that contains 
the gauge field A?,. The fermion in the multiplet should differ in spin by 
l unit. To write a renormalizable theory, we must take this to be a spin-4 
Weyl fermion. I will then define the vector supermultiplet 


(Am Aa, D°) (11.74) 


including the gauge field, a Weyl fermion in the adjoint representation 
of the gauge group, and an auxililary real scalar field, also in the adjoint 
representation, that will have no independent particle content. The particle 
content of this multiplet is one massless vector boson, with two transverse 
polarization states, and one massless fermion and antifermion, for each 
generator of the gauge group. The fermion is often called a gaugino. The 
number of physical states is again equal between bosons and fermions. 
The supersymmetry transformations for this multiplet are 


A = [EITAS + Ate] 
eàt = fio" F2 p + DIE 
dal? = Et io” Fe, + D*] 
6¢D* = —i[216" DmA" — DAO] (11.75) 
where 


(o™a" —o0"G"™) . (11.76) 
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I encourage you to verify that these tranformations obey the algebra 
lóg, bn] = 2i (ETN — HOE) Om + ba , (11.77) 
where ôa is a gauge tranformation with the gauge parameter 
a= —2i(EtT n — nla™€) AZ, . (11.78) 


Acting on A“, the extra term ĝa in (11.77) can be combined with the trans- 
lation to produce the commutation relation 


[dg] A® = 2i G n— NTE) (Dm A). (11.79) 


This rearrangement applies also for the auxiliary field D° and for any matter 
field that tranforms linearly under G. The gauge field A%™” does not satisfy 
this last criterion; instead, we find 


le, 8n] Am = 26(€'a"n — nT” E (a nam — DmAn) 
= 2i(€1o"n — nlaré) Fe (11.80) 


The proof that (11.75) satisfies the supersymmetry algebra is more tedious 
than for (11.29), but it is not actually difficult. For the transformation of 
à% we need both the Fierz identity (11.30) and the relation 


naka — (En) = —(€Peopgn)(oP%e)ap - (11.81) 


The matrices o?%c and co”? are symmetric in their spinor indices. 

Again, the transformation laws leave a simple Lagrangian invariant. For 
the vector supermultiplet, this Lagrangian is that of the renormalizable 
Yang-Mills theory including the gaugino: 


1 1 
Ly =—F(Finn) + + ris. DA? + t(D (11.82) 


The kinetic term for D°“ contains no derivatives, so this field will be a 
Lagrange multiplier. 

The vector supermultiplet can be coupled to matter particles in chiral 
supermultiplets. To do this, we must first modify the transformation laws 
of the chiral supermultiplet so that the commutators of supersymmetry 
transformations obey (11.77) or (11.79). The modified transformation laws 
are: 


beh = V2ET ey 
deb = Vio” cE” Dne + V2FE 
eF = -vV ET Dm — gel Ato (11.83) 
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In this formula, the chiral fields ¢, Y, F must belong to the same rep- 
resentation of G, with t° a representation matrix in that representation. 
From the transformation laws, we can construct the Lagrangian. Start 
from (11.31), replace the derivatives by covariant derivatives, add terms 
to the Lagrangian involving the A% to cancel the supersymmetry variation 
of these terms, and then add terms involving D® to cancel the remaining 
supersymmetry variation of the A* terms. The result is 


Lp =D" d*Dnot Vti - Dy + F*F 
—V29q(b* AT t ey) _ wt cA t e) + gD%¢°t*¢ ; (11.84) 


The proof that this Lagrangian is supersymmetric, 6¢£ = 0, is completely 
straightforward, but it requires a very large sheet of paper. 

The gauge invariance of the theory requires the superpotential La- 
grangian Lw to be invariant under G as a global symmetry. Under this 
condition, Lw, which contains no derivatives, is invariant under (11.83) 
without modification. The combination of Lp, Lp, and Lw, with W a 
polynomial of degree at most 3, gives the most general renormalizable su- 
persymmetric gauge theory. 

As we did with the F field of the chiral multiplet, it is interesting to 
eliminate the Lagrange multiplier D®. For the Lagrangian which is the sum 
of (11.82) and (11.84), the equation of motion for D® is 


D° = —g¢*t*o . (11.85) 


Eliminating D® gives a second potential energy term proportional to (D*)?. 
This is the D-term potential promised below (11.40). I will write the result 
for a theory with several chiral multiplets: 


2 
Vo = 59° z sison) (11.86) 
k 


As with the F-term potential, Vp > 0 and vanishes if and only if D° = 0. 
It can be shown by an argument similar to (11.45) that 


(0| D° |0} =0 (11.87) 


unless supersymmetry is spontaneously broken. 

It makes a nice illustration of this formalism to show how the Higgs 
mechanism works in supersymmetry. For definiteness, consider a super- 
symmetric gauge theory with the gauge group U(1). 
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Introduce chiral supermultiplets ¢4, ¢_, and X, with charges +1, —1, 
and 0, respectively, and the superpotential 
W = X(o4¢_ —v")X . (11.88) 
The F = 0 equations are 


Fx =(¢4¢--v?)=0 FE=¢1X=0. (11.89) 


To solve these equations, set 


X=0 ġ=v/y b-=vy, (11.90) 
where y is a complex-valued parameter. The D = 0 equation is 
p e+- olo- =0. (11.91) 


This implies |y| = 1. So y is a pure phase and can be removed by a U(1) 
gauge transformation. 

Now look at the pieces of the Lagrangian that give mass to gauge bosons, 
fermions, and scalars. The gauge field receives mass from the Higgs mech- 
anism. To compute the mass, we can look at the scalar kinetic terms 


$\(—D?)b4 + 6! (—D?)b_ = +--+ Gh. (g?A?)b4 + GL (g?A?)d_ . (11.92) 
Putting in the vacuum expectation values 6, = ¢_ = v, we find 
m’ = 49v? (11.93) 


for the vector fields. The mode of the scalar field 
564 =n/ V2 s$- = -n/V2, (11.94) 


with 7 real, receives a mass from the D-term potential energy 


ae t t 2 
x (P4b+ — 6_9-) (11.95) 


Expanding to quadratic order in 7, we see that 7 also receives the mass 
m? = 4g?v?. The corresponding mode for 7 imaginary is the infinitesimal 
version of the phase rotation of y that we have already gauged away below 
(11.91). The mode of the fermion fields 


õp} =x/V2 öp- =-x/V2 (11.96) 
mixes with the gaugino through the term 


—V29(b) AT ew, — o AT cey_) + h.c. (11.97) 
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Putting in the vacuum expectation values ¢4 = ¢_ = v, we find a Dirac 
mass with the value 


m = 2gv (11.98) 


In all, we find a massive vector boson, a massive real scalar, and a massive 
Dirac fermion, all with the mass m = 2gv. The system has four physical 
bosons and four physical fermions, all with the same mass, as supersym- 
metry requires. 


11.2.5. The Vector Supermultiplet in Superspace 


The vector supermultiplet has a quite simple representation in superspace. 
This multiplet turns out to be the answer to the question that we posed in 
our discussion of superspace in the previous section: When the chiral and 
antichiral components of a general superfield are removed, what is left over? 
To analyze this issue, I will write a Lagrangian containing a local symmetry 
that allows us to gauge away the chiral and antichiral components of this 
superfield. Let V(x,0,0) be a real-valued superfield, acted on by a local 
gauge transformation in superspace 


ôV = S (A= at) (11.99) 
where A is a chiral superfield and A‘ is its conjugate. Since A satisfies 
(11.60), its expansion in powers of 0 contains 

A(x, 0,0) = A(x + i060, 0) = a(x) +--+ +106" 00na(x) +--+ (11.100) 
The general superfield V contains a term? 
V(a, 0,0) =--- +2060 Am(ax) +: (11.101) 


So the superfield V contains a space-time vector field Am(x), and under 
(11.99), Am transforms as 


ei, ðn (Rea) , (11.102) 


This is just what we would like for an Abelian gauge field. So we should 
accept (11.99) as the generalization of the Abelian gauge transformation to 
superspace. 

The real-valued superfield transforming under (11.99) is called a vector 
superfield. To understand its structure, use the gauge transformation to 


bThe factor 2 in this equation is convenient but disagrees with some standard treatments, 
e.g., [7]. 


634 Michael E. Peskin 


remove all components with powers of 0 or 6 only. This choice is called 
Wess-Zumino gauge [33]. What remains after this gauge choice is 


V(x,0,0) = 2000 Am (x) +20 67 c\ — 2070 cà" +020 D. (11.103) 
This expression has exactly the field content of the Abelian vector super- 
multiplet (Am, A, D). 

This gauge multiplet can be coupled to matter described by chiral su- 


perfields. For the moment, I will continue to discuss the Abelian gauge 
theory. For a chiral superfield ® with charge Q, the gauge transformation 


5® = iQA® (11.104) 


contains a standard Abelian gauge transformation with gauge parameter 
Rea(a) and also preserves the chiral nature of ®. Then the superspace 
Lagrangian 


[oat Piere p (11.105) 


is gauge-invariant. Using the representation (11.103) and the rules (11.62), 
it is straightforward to carry out the integrals explicitly and show that 
(11.105) reduces to (11.84), with t° = Q for this Abelian theory. 

We still need to construct the pure gauge part of the Lagrangian. To 
do this, first note that, because a quantity antisymmetrized on three Weyl 
fermion indices vanishes, 


DaD X=0 (11.106) 


for any superfield X. Thus, acting with D’ makes any superfield a chiral 
superfield. The following is a chiral superfield that also has the property 
that its leading component is the gaugino field A(x): 


1— 
Wa = —;D"(De)aV (11.107) 


Indeed, working this out in full detail, we find that Wa = Wa(x + i000, 0), 
with 


Walz, 0) = Aa + [(io"" Finn + DO), + 07 CO [mAic], (11.108) 


The chiral superfield Wa is the superspace analogue of the electromag- 
netic field strength. The Lagrangian 


1 
[ev zW” Ww (11.109) 
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reduces precisely to the Abelian version of (11.82). It is odd that the kinetic 
term for gauge fields is an F-term rather than a D-term. It turns out that 
this term can be renormalized by loop corrections as a consequence of the 
trace anomaly [34]. However, the restricted form of the correction has 
implications, both some simple ones that I will discuss later in Section 4.3 
and and more profound implications discussed, for example, in [35,36]. 

I will simply quote the generalizations of these results to the non-Abelian 
case. The gauge transformation of a chiral superfield in the representation 
R of the gauge group is 


oo ot ateHA | (11.110) 


where A® is a chiral superfield in the adjoint representation of G and t° is 
is the representation of the generators of G in the representation R. The 
gauge transformation of the vector superfield is 


BIN eh Ve (11.111) 
Then the Lagrangian 
|porno" (11.112) 


is locally gauge-invariant. Carrying out the integrals in the gauge (11.103) 
reduces this Lagrangian to (11.84). 
The form of the field strength superfield is rather more complicated than 
in the Abelian case, 
E aya ae 
Watt = =e Die t (De)ae V“! (11.113) 
g 
In Wess-Zumino gauge, this formula does reduce to the non-Abelian version 
of (11.108), 


Wel, 0) = A2 + [Go F8 n + D0), + 07 co [DmA**ig™c],, - (11.114) 


Then the Lagrangian 
[coupe] (11.115) 


reduces neatly to (11.82). 

The most general renormalizable supersymmetric Lagrangian can be 
built out of these ingredients. We need to put together the Lagrangian 
(11.115), plus a term (11.112) for each matter chiral superfield, plus a 
superpotential Lagrangian to represent the scalar field potential energy. 
These formulae can be generalized to the case of a nonlinear sigma model 
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on a Kahler manifold, with the gauge symmetry associated with an isometry 
of this target space. For the details, see [7]. 


11.2.6. R-Symmetry 


The structure of the general superspace action for a renormalizable theory of 
scalar and fermion fields suggests that this theory has a natural continuous 
symmetry. 

The superspace Lagrangian is 


c= | eouiwrew)+ | ioaten to f Powa fewo). 

(11.116) 
Consider first the case in which W (¢) contains only dimensionless param- 
eters and is therefore a cubic polynomial in the scalar fields. Then £ is 
invariant under the U (1) symmetry 


Pelz, 0) — e703, (a, e0),  V2(x,0,0) > V° (x, e0, e70) 
(11.117) 
or, in components, 


bk = Brea. , Wy ae telag l Nt > eit f (11.118) 


and the gauge fields are invariant. This transformation is called R- 
symmetry. Under R-symmetry, the charges of bosons and fermions differ 
by 1 unit, in such a way that that the gaugino and superpotential vertices 
have zero net charge. 

Since all left-handed fermions have the same charge under (11.118), the 
R-symmetry will have an axial vector anomaly. It can be shown that the 
R-symmetry current (of dimension 3, spin 1) forms a supersymmetry mul- 
tiplet together with the supersymmetry current (dimension 7, spin 3) and 
the energy-momentum tensor (dimension 4, spin 2) [37]. All three currents 
have perturbation-theory anomalies; the anomaly of the energy-momentum 
tensor is the trace anomaly, associated with the breaking of scalar invari- 
ance by coupling constant renormalization. The R-current anomaly is thus 
connected to the running of coupling constants and gives a useful formal 
approach to study this effect in supersymmetric models. 

It is often possible to combine the transformation (11.117) with other 
apparent U(1) symmetries of the theory to define a non-anomalous U(1) 
R-symmetry. Under such a symmetry, we will have 


&,(2,0) > e "O(a, e0), such that W(x,0) — e*W(a,e'6) . 
(11.119) 
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Such symmetries also often arise in models in which the superpotential has 
dimensionful coefficients. 

In models with extended, N > 1, supersymmetry, the R-symmetry 
group is also extended, to SU(2) for N = 2 and to SU(4) for N = 4 
supersymmetry. 


11.3. The Minimal Supersymmetric Standard Model 


11.3.1. Particle Content of the Model 


Now we have all of the ingredients to construct a supersymmetric general- 
ization of the Standard Model. To begin, let us construct a version of the 
Standard Model with exact supersymmetry. To do this, we assign the vec- 
tor fields in the Standard Model to vector supermultiplets and the matter 
fields of the Standard Model to chiral supermultiplets. 

The vector supermultiplets correspond to the generators of SU(3) x 
SU(2) x U(1). In these lectures, I will refer to the gauge bosons of these 
groups as A% , WS , and Bm, respectively. I will represent the Weyl fermion 
partners of these fields as 9°, W°, b. I will call these fields the gluino, wino, 
and bino, or, collectively, gauginos. In the later parts of these lectures, I 
will drop the tildes over the gaugino fields when they are not needed for 
clarity. 

I will assign the quarks and leptons to be fermions in chiral superfields. 
I will use the convention presented in Section 1.3 of considering left-handed 
Weyl] fermions as the basic particles and right-handed Wey] fermions as their 
antiparticles. In the Standard Model, the left-handed fields in a fermion 
generation have the quantum numbers 


z=(%) z a=) u d (11.120) 


The field € is the left-handed positron; the fields u, d are the left-handed 
antiquarks. The right-handed Standard Model fermion fields are the con- 
jugates of these fields. To make a generalization to supersymmetry, we will 
extend each of the fields in (11.120)—for each of the three generations—to 
a chiral supermultiplet. I will use the symbols 


be Se. ð ta d (11.121) 


to represent both the supermultiplets and the scalar fields in these multi- 
plets. Again, I will drop the tilde if it is unambiguous that I am referring 
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to the scalar partner rather than the fermion. The scalar particles in these 
supermultiplets are called sleptons and squarks, collectively, sfermions. 

What about the Higgs field? The Higgs field of the Standard Model 
should be identified with a complex scalar component of a chiral supermul- 
tiplet. But it is ambiguous what the quantum numbers of this multiplet 
should be. In the Standard Model, the Higgs field is a color singlet with 
I= Z, but we can take the hypercharge of this field to be either Y = +4 
or Y = -—ż4, depending on whether we take the positive hypercharge field 
or its conjugate to be primary. In a supersymmetric model, the choice 
matters. The superpotential is an analytic function of superfields, so it can 
only contain the field, not the conjugate. Then different Higgs couplings 
will be allowed depending on the choice that we make. 

The correct solution to this problem is to include both possibilities, That 
is, we include a Higgs supermultiplet with Y = +4 and a second Higgs 
supermultiplet with Y = —+. I will call the scalar components of these 


2 
multiplets H,, and Ha, respectively: 


Ht He 
H, = e ) H= (a (11.122) 


I will refer to the Weyl fermion components with these quantum numbers 
as hu, ħa. These fields or particles are called Higgsinos. 

I will argue below that it is necessary to include both Higgs fields in 
order to obtain all of the needed couplings in the superpotential. However, 
there is another argument. The axial vector anomaly of one U (1) and two 
SU (2) currents (Fig. 11.5) must vanish to maintain the gauge invariance 
of the model. In the Standard Model, the anomaly cancels nontrivially 
between the quarks and the leptons. In the supersymmetric generalization 
of the Standard Model, each Higgsino makes a nonzero contribution to this 
anomaly. These contributions cancel if we include a pair of Higgsinos with 
opposite hypercharge. 


SU(2) w(Y=5) 


SU(2) AA 


Fig. 11.5. The anomaly cancellation that requires two doublets of Higgs fields in 
the MSSM. 
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11.3.2. Grand Unification 


Before writing the Lagrangian in detail, I would like to point out that there 
is an interesting conclusion that follows from the quantum number assign- 
ments of the new particles that we have introduced to make the Standard 
Model supersymmetric. 

An attractive feature of the Standard Model is that the quarks and 
leptons of each generation fill out multiplets of the simple gauge group 
SU(5). This suggests a very beautiful picture, called grand unification, in 
which SU(5), or a group such as SO(10) or Ee for which this is a subgroup, 
is the fundamental gauge symmetry at very short distances. This unified 
symmetry will be spontaneously broken to the Standard Model gauge group 
SU(3) x SU(2) x U(1). 

For definiteness, I will examine the model in which the grand unified 
symmetry group is SU(5). The generators of SU(5) can be represented 
as 5 x 5 Hermitian matrices acting on the 5-dimensional vectors in the 
fundamental representation. To see how the Standard Model is embedded 
in SU(5), it is convenient to write these matrices as blocks with 3 and 2 
rows and columns. Then the Standard Model generators can be identified 


as 
sua): (" a sua): (° ail U(1) : (m5 2). 
(11.123) 


In these expressions, t° is an SU (3) generator, 07/2 is an SU(2) generator, 
and all of these matrices are normalized to tr[T4T?] = 484P. We should 
identify the last of these matrices with 3/5 Y. 

The symmetry-breaking can be caused by the vacuum expectation value 
of a Higgs field in the adjoint representation of SU(5). The expectation 
value 


(=V. a A (11.124) 


commutes with the generators in (11.123) and fails to commute with the 
off-diagonal generators. So this vacuum expectation value gives mass to the 
off-diagonal generators and breaks the gauge group to SU(3)x SU(2)xU(1). 

Matter fermions can be organized as left-handed Weyl fermions in the 
SU(5) representations 5 and 10. The 5 is the conjugate of the fundamental 
representation of SU (5); the 10 is the antisymmetric matrix with two 5 
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indices. 


oy 
D 
oedel 


(11.125) 


aoTAa Qa 


Yoo AQ Alal 
osres 


L 0 L 


It is straightforward to check that each entry listed has the quantum num- 
bers assigned to that field in the Standard Model. To compute the hy- 
percharges, we act on the 5 with (—1) times the hypercharge generator in 
(11.123), and we act on the 10 with the hypercharge generator on each 
index. This gives the standard results, for example, Y = +3 for the d and 
Y=-$+5 = for u and d. 

The SU (5) covariant derivative is 


Dm = (Om — igu AATA) , (11.126) 


where gy is the SU(5) gauge coupling. There is only room for one value 
here. So this model predicts that the three Standard Model gauge couplings 
are related by 


93 59 =J = JW, (11.127) 


where 


5 
93 = Js g2= 9 n= s . (11.128) 


Clearly, this prediction is not correct for the gauge couplings that we mea- 
sure in particle physics. 

However, there is a way to save this prediction. In quantum field theory, 
coupling constants are functions of length scale and change their values 
significantly from one scale to another by renormalization group evolution. 
It is possible that the values of g’, g, and gs that we measure could evolve 
at very short distances into values that obey (11.127). 

I will now collect the formulae that we need to analyze this question. 
Let 

Qa; = i (11.129) 
4n 
for i = 1,2,3. The one-loop renormalization group equations for gauge 
couplings are 
dgi = bi 
dlogQ (4r 


da; b; 
3 t a 11.130 


Supersymmetry in Elementary Particle Physics 641 


For U(1), the coefficient bı is 
2 3 1 3 
bi = y? Y 11.131 
1 3 > 5 $ 3 > 5 b> ( ) 


where the two sums run over the multiplets of left-handed Weyl fermions 
and complex-valued bosons. The factors Y? are the squares of the U(1) 
charges defined by (11.123). For non-Abelian groups, the expressions for 


the b coefficients are 


11 2 1 
b= -= 02(G) - 320l) = 2 Oln ; (11.132) 


where C2(G) and C(r) are the standard group theory coefficients. For 
SU(N), 


1 
Cx(G)=C(G)=N, C(N)= 3S (11.133) 
The solution of the renormalization group equation (11.130) is 
bi, Q 
oj=a My lee 
aT (Q) = aM) — = log = > (11.134) 


Now consider the situation in which the three couplings g; become equal 
at the mass scale My, the mass scale of SU(5) symmetry breaking. Let ay 
be the value of the a; at this scale. Using (11.134), we can then determine 
the Standard Model couplings at any lower mass scale. The three a;(Q) 
are determined by two parameters. We can eliminate those parameters and 
obtain the relation 


a3" = (1 + B)az* — Baz" (11.135) 
where 
bs — b2 
B= N 11.136 


The values of the a; are known very accurately at Q = mz [88]: 


az" = 8.504014 az'=29.5740.02 a;'=59.00+0.02. (11.137) 


Inserting these values into (11.135), we find 
B=0.716 + 0.005 + 0.03 . (11.138) 


In this formula, the first error is that propagated from the errors in 
(11.137) and the second is my estimate of the systematic error from neglect- 
ing the two-loop renormalization group coefficients and other higher-order 
corrections. 
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We can compare the value of B in (11.138) to the values of (11.136) from 
different models. The hypothesis that the three Standard Model couplings 
unify is acceptable only if the gauge theory that describes physics between 
mz and My gives a value of B consistent with (11.138). The minimal 
Standard Model fails this test. The values of the b; are 


4 
b3 = 11 — gng 
22 4 1 
TA ai 
4 1 


where ng is the number of generations and np, is the number of Higgs 
doublets. Notice that ng cancels out of (11.136). This is to be expected. 
The Standard Model fermions form complete representations of SU (5), and 
so their renormalization effects cannot lead to differences among the three 
couplings. For the minimal case np = 1 we find B = 0.53. To obtain a 
value consistent with (11.138), we need np, = 6. 

We can redo this calculation in the minimal supersymmetric version of 
the Standard Model. First of all, we should rewrite (11.132) for a super- 
symmetric model with one vector supermultiplet, containing a vector and 
a Weyl fermion in the adjoint representation, and a set of chiral supermul- 
tiplets indexed by k, each with a Weyl fermion and a complex boson. Then 
(11.132) becomes 


n= FCG) -FO - (F +5) Helm) 
k 
= 302(G) — S°C(rx) (11.140) 
k 


The formula (11.131) undergoes a similar rearrangement. Inserting the 
values of the C (rx) for the fields of the Standard Model, we find 


b3 =9- 21g 
1 
bz = 6 — 2ng — g h 
3 
b= —2ng — 07 (11.141) 
For the minimal Higgs content np, = 2, this gives 
5 
B= z= 0.714 (11.142) 


in excellent agreement with (11.138). 
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In Fig. 11.6, I show the unification relation pictorially. The three data 
points on the the left of the figure represent the measured values of the three 
couplings (11.137). Starting from the values of a; and a2, we can integrate 
(11.130) up to the scale at which these two couplings converge. Then we 
can integrate the equation for a3 back down to Q = mz and see whether 
the result agrees with the measured value. The lower set of curves presents 
the result for the Standard Model with np = 1. The upper set of curves 
shows the result for the supersymmetric extension of the Standard Model 
with n, = 2. This choice gives excellent agreement with the measured value 
of as. 


0.150 ic ™ ™T ™—T ™T ™—T 
ai(Q) 

0.125 

0.100 

0.075 
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0.025 


Caise kh aater ge l a 
0.00790 103 108 109 1012 1015 w8 Q (GeV) 


Fig. 11.6. Prediction of the SU (3) gauge coupling a; from the electroweak cou- 
pling constants using grand unification, in the Standard Model and in the MSSM. 


Actually, I slightly overstate the case for supersymmetry by ignoring 
two-loop terms in the renormalization group equations, and also by inte- 
grating these equations all the way down to mz even though, from searches 
at high-energy colliders, most of the squarks and gluinos must be heavier 
than 300 GeV. A more accurate prediction of a,(mz) from the electroweak 
coupling constants gives a slightly higher value, 0.13 instead of 0.12. How- 
ever, these corrections could easily be compensated by similar corrections 
to the upper limit of the integration, following the details of the particle 
spectrum at the grand unification scale. For a more detailed formal analy- 
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sis of these corrections, see [39], and for a recent evaluation of their effects, 
see [40]. It remains a remarkable fact that the minimal supersymmetric 
extension of the Standard Model is approximately compatible with grand 
unification ‘out of the box’, with no need for further model-building. 


11.3.3. Construction of the Lagrangian 


Now I would like to write the full Lagrangian of the minimal supersym- 
metric extension of the Standard Model, which I will henceforth call the 
MSSM. 

The kinetic terms and gauge couplings of the MSSM Lagrangian are 
completely determined by supersymmetry, the choice of the gauge group 
SU(3) x SU(2) x U(1), and the choice of the quantum numbers of the 
matter fields. The Lagrangian is a sum of terms of the forms (11.82) and 
(11.84). Up to this point, the only parameters that need to be introduced 
are the gauge couplings g1, g2, and g3. 

Next, we need a superpotential W. The superpotential is the source of 
nonlinear fermion-scalar interactions, so we should include the appropriate 
terms to generate the Higgs Yukawa couplings needed to give mass to the 
quarks and leptons. The appropriate choice is 


Wy = yi HantapQ) + yte Haacap li, — yit HuctopQ). (11.143) 


The notation for the quark and lepton multiplets is that in (11.120); the 
indices 7,7 = 1,2,3 run over the three generations. The indices a, 8 = 1,2 
run over SU(2) isospin indices. Notice that the first two terms require a 
Higgs field Hg with Y = —4, while the third term requires a Higgs field 
A, with Y = 5. If we leave out one of the Higgs multiplets, some quarks 
or leptons will be left massless. This is the second argument that requires 
two Higgs fields in the MSSM. 

I have written (11.143) including the most general mixing between left- 
and right-handed quarks and leptons of different generations. However, as 
in the minimal Standard Model, we can remove most of this flavor mixing by 
appropriate field redefinitions. The coupling constants yg, Ye, Yu are general 
3 x 3 complex-valued matrices. Any such matrix can be diagonalized using 
two unitary transformations. Thus, we can write 


Ya=WaYaVi = ye=WeYVI yu = WaYaVi , (11.144) 


with Wa and V, 3 x 3 unitary matrices and Y, real, positive, and diagonal. 
The unitary transformations cancel out of the kinetic energy terms and 
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gauge couplings in the Lagrangian, except that the W boson coupling to 
quarks is transformed 


gua” dwt > guia™(ViVadWe . (11.145) 


From this equation, we can identify (vi Vu) = Voxm, the Cabibbo- 
Kobayashi-Maskawa weak interaction mixing matrix. The Lagrangian term 
(11.143) thus introduces the remaining parameters of the Standard Model, 
the 9 quark and lepton masses (ignoring neutrino masses) and the 4 CKM 
mixing angles. The field redefinition (11.144) can also induce or shift a QCD 
theta parameter, so the MSSM, like the Standard Model, has a strong CP 
problem that requires an axion or another model-building solution [41]. 

There are several other terms that can be added to W. One possible 
contribution is a pure Higgs term 


W, = —uHaacap Hug - (11.146) 


The parameter u has the dimensions of mass, and consequently this mu 
term provides a supersymmetric contribution to the masses of the Higgs 
bosons. Because this term is in the superpotential, it does not receive ad- 
ditive raditive corrections. Even in a theory that includes grand unification 
and energies scale of the order of 1016 GeV, we can set the parameter u 
to be of order 100 GeV without finding this choice affected by large quan- 
tum corrections. We will see in Section 4.2 that the mu term is needed for 
phenomenological reasons. If u = 0, a Higgsino state will be massless and 
should have been detected already in experiments. It is odd that a theory 
whose fundamental mass scale is the grand unification scale should require 
a parameter containing a weak interaction mass scale. I will present some 
models for the origin of this term in Section 3.5. 

At this point, we have introduced two new parameters beyond those in 
the Standard Model. One is the value of u. The other is the result of the 
fact that we have two Higgs doublets in the model. The ratio of the Higgs 
vacuum expectation values 


(Hu) / (Ha) = tan 8 (11.147) 


will appear in many of the detailed predictions of the MSSM. 
There are still more superpotential terms that are consistent with the 
Standard Model gauge symmetry and quantum numbers. These are 


Wg = MeijkUid; dr T nodéaglaQs 
+73€apLale + Na€aplaHug - (11.148) 
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Here i, j, k are color indices, a, 8 are isospin indices, and arbitrary genera- 
tion mixing is also possible. These terms violate baryon and lepton number 
through operators with dimensionless coefficients. In constructing super- 
symmetric models, it is necessary either to forbid these terms by imposing 
appropriate discrete symmetries or to arrange by hand that some of the 
dangerous couplings are extremely small [42]. 

If baryon number B and lepton number L are conserved in a supersym- 
metric model, this model respects a discrete symmetry called R-parity, 


R= (= Prey « (11.149) 


Here (3B) is quark number and J is the spin of the particle. This quantity 
is constructed so that R = +1 on the particles of the Standard Model 
(including the Higgs bosons) and R = —1 on their supersymmetry partners. 
R acts differently on particles of different spin in the same supermultiplet, 
so R-parity is a discrete subgroup of a continuous R-symmetry. 

In a model with grand unification, there will be baryon number and 
lepton number violation, and so B and L cannot be used as fundamental 
symmetries. However, we can easily forbid most of the superpotential terms 
(11.148) by introducing a discrete symmetry that distinguishes the field Hy 
from the lepton doublets L;. A similar strategy can be used to forbid the 
first, 3-quark, term. With these additional discrete symmetries, the MSSM, 
including all other terms considered up to this point, will conserve R-parity. 


11.3.4. The Lightest Supersymmetric Particle 


If R-parity is conserved, the lightest supersymmetric particle will be abso- 
lutely stable. This conclusion has an important implication for the relation 
of supersymmetry to cosmology. If a supersymmetric particle is stable for 
a time longer than the age of the universe, and if this particle is electrically 
neutral, that particle is a good candidate for the cosmic dark matter. In 
Sections 6.3 and 6.4, I will discuss in some detail the properties of mod- 
els in which the lightest Standard Model superpartner is the dark matter 
particle. 

However, this is not the only possibility. Over times much longer than 
those of particle physics experiments—minutes, years, or billions of years— 
we need to consider the possibility that the lightest Standard Model super- 
partner will decay to a particle with only couplngs of gravitational strength. 
Complete supersymmetric models of Nature must include a superpartner of 
the graviton, a spin-3 particle called the gravitino. In a model with exact 
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supersymmetry, the gravitino will be massless, but in a model with spon- 
taneously broken supersymmetry, the gravitino acquires a mass through 
an analogue of the Higgs mechanism. If the supersymmetry breaking is 
induced by one dominant F-term, the value of this mass is [43] 


This expression is of the same order of magnitude as the expressions for 
Standard Model superpartner masses that I will give in Section 3.6. In 
string theory and other unified models, there may be additional Standard 
Model singlet fields with couplings of gravitation strength, called moduli, 
that might also be light enough that long-lived Standard Model superpart- 
ners could decay to them. 

Supersymmetric models with R-parity conservation and dark matter, 
then, divide into two classes, according to the identity of the lightest super- 
symmetric particle—the LSP. On one hand, the LSP could be a Standard 
Model superpartner. Cosmology requires that this particle is neutral. Sev- 
eral candidates are available, including the fermionic partners of the photon, 
Z°, and neutral Higgs bosons and the scalar partner of one of the neutrinos. 
In all cases, these particles will be weakly interacting; when they are pro- 
duced at high-energy colliders, they should not make signals in a particle 
detector. On the other hand, the LSP could be the gravitino or another 
particle with only gravitational couplings. In that case, the lightest Stan- 
dard Model superpartner could be a charged particle. Whether this particle 
is visible or neutral and weakly interacting, its decay should be included in 
the phenomenology of the model. 


11.3.5. Models of Supersymmetry Breaking 


There is still one important effect that is missing in our construction of the 
MSSM. The terms that we have written so far preserve exact supersym- 
metry. A fully supersymmetric model would contain a massless fermionic 
partner of the photon and a charged scalar particle with the mass of the 
electron. These particles manifestly do not exist. So if we wish to build a 
model of Nature with supersymmetry as a fundamental symmetry, we need 
to arrange that supersymmetry is spontaneously broken. 

From the example of spontaneous symmetry breaking in the Standard 
Model, we would expect to do this by including in the MSSM a field whose 
vacuum expectation value leads to supersymmetry breaking. This is not as 
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easy as it might seem. To explain why, I will first present some models of 
supersymmetry breaking. 

The simplest model of supersymmetry breaking is the O’Raifeartaigh 
model [44], with three chiral supermultiplets ¢o, 61, ¢2 interacting through 
the superpotential 


W = Ado + Mo142 + gog? - (11.151) 


This superpotential implies the F = 0 conditions 
0 = Fè =A+9¢? 
0 = FY = mġ2 + 29¢0¢1 
0 = FF = moi (11.152) 


The first and third equations contradict one another. It is impossible to 
satisfy both conditions, and so there is no supersymmetric vacuum state. 
This fulfils the condition for spontaneous supersymmetry breaking that I 
presented in Section 2.2. 

This mechanism of supersymmetry breaking has an unwanted corollary. 
Because one combination of the scalar fields appears in two different con- 
straints in (11.152), there must be an orthogonal combination that does not 
appear at all. This means that the F-term potential Vr has a surface of 
degenerate vacuum states. To see this explicitly, pick a particular vacuum 
solution 


go = 1 = $2 =0. (11.153) 


and expand the potential Vr about this point. There are 6 real-valued 
boson fields with masses 


0, 0, m, m, Vm?—2d4\g, Wm?2+2dAq. (11.154) 


These six fields do not pair into complex-valued fields; that is already an 
indication that supersymmetry is broken. The fermion mass term in (11.38) 
gives one Dirac fermion mass m and leaves one Weyl fermion massless. 
This massless fermion is the Goldstone particle associate with spontaneous 
supersymmetry breaking. 

A property of these masses is that the sum rule for fermion and boson 
masses 


str[m?] =X m; — m =0 (11.155) 


remains valid even when supersymmetry is broken. This sum rule is the 
coefficient of the one-loop quadratic divergence in the vacuum energy. Since 
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supersymmetry breaking does not affect the ultraviolet structure of the 
theory, this coefficient must cancel even if supersymmetry is spontaneously 
broken [45]. In fact, if Q is a conserved charge in the model, the sum rule 
is valid in each charge sector Q = q: 


strg[m?] =0. (11.156) 


In the O’Raifeartaigh model, supersymmetry is spontaneously broken 
by a nonzero expectation value of an F term. It is also possible to break 
supersymmetry with a nonzero expectation value of a D term. The D-term 
potential Vp typically has zeros. For example, in an SU(3) supersymmetric 
Yang-Mills theory, 

2 
1 mtani 
V= Ee ee og (11.157) 
3 3 

and it is easy to find solutions in which the terms in parentheses sum to 
zero. However, it is not difficult to arrange a Vr such that the solutions 
of the F = 0 conditions do not coincide with the solutions of the D = 0 
conditions. This leads to spontaneous symmetry breaking, again with the 
sum rule (11.156) valid at tree level. 

Unfortunately, the sum rule (11.156) is a disaster for the prospect of 
finding a simple model of spontaneously broken supersymmetry that ex- 
tends the Standard Model. For the charge sector of the d squarks, we 
would need all down-type squarks to have masses less than 5 GeV. For the 
charge sector of the charged leptons, we would need all sleptons to have 
masses less than 2 GeV. 


11.3.6. Soft Supersymmetry Breaking 


The solution to this problem is to construct models of spontaneously broken 
supersymmetry using a different strategy from the one that we use for elec- 
troweak symmetry breaking in the Standard Model. To break electroweak 
symmetry, we introduce a Higgs sector whose mass scale is the same as 
the scale of the fermion and gauge boson masses induced by the symmetry 
breaking. To break supersymmetry, however, we could introduce a new 
sector at a much higher mass scale, relying on a weak coupling of the new 
sector to the Standard Model particles to communicate the supersymme- 
try breaking terms. In principle, a weak gauge interaction could supply 
this coupling. However, the default connection is through gravity. Gravity 
and supergravity couple to all fields. It can be shown that supersymmetry 
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breaking anywhere in Nature is communicated to all other sectors through 
supergravity couplings. 

We are thus led to the following picture, which produces a phenomeno- 
logically reasonable supersymmetric extension of the Standard Model: We 
extend the Standard Model fields to supersymmetry multiplets in the man- 
ner described in Section 3.1. We also introduce a hidden sector with no di- 
rect coupling to quark, leptons, and Standard Model gauge bosons. Super- 
symmetry is spontaneously broken in this hidden sector. A weak interaction 
coupling the two sectors then induces a supersymmetry-breaking effective 
interaction for the Standard Model particles and their superpartners. If A 
is the mass scale of the hidden sector, the supersymmetry breaking mass 
terms induced for the Standard Model sector are of the order of 


2 
m~ a ~ > ; (11.158) 


where M is the mass of the particle responsible for the weak connection 
between the two sectors. M is called the messenger scale. By default, the 
messenger is supergravity. Then M = mp, and A ~ 10!! GeV. In this 
scenario, the superpartners acquire masses of the order of the parameter m 
in (11.158). 

It remains true that the quarks, leptons, and gauge bosons cannot ob- 
tain mass until SU(2) x U(1) is broken. It is attractive to think that 
the symmetry-breaking terms that give mass to the superpartners cause 
SU(2) x U(1) to be spontaneously broken, at more or less the same scale. 
I will discuss a mechanism by which this can happen in Section 6.1. The 
weak interaction scale would then not be a fundamental scale in Nature, but 
rather one that arises dynamically from the hidden sector and its couplings. 

The effective interaction that are generated by messenger exchange gen- 
erally involve simple operators of low mass dimensions, to require the min- 
imal number of powers of M in the denominator. These operators are soft 
perturbations of the theory, and so we say that the MSSM is completed by 
including soft supersymmetry-breaking interactions. 

However, the supersymmetry-breaking terms induced in this model will 
not include all possible low-dimension operators. Since these interactions 
arise by coupling into a supersymmetry theory, they are formed by starting 
with a supersymmetric effective action and turning on F and D expectation 
values as spurions. Only a subset of the possible supersymmetry-breaking 
terms can be formed in this way [46]. By replacing a superfield ® by 
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67 cO (F), we can convert 
[HOKE > moto 
J dO f(D)WTceW > mAT cà 
fe W(6, ¢) > BP + Ag? (11.159) 


However, as long as the ¢ theory is renormalizable, we cannot generate the 
terms 


my cp, Cé*¢?, (11.160) 


by turning on expectation values for F and D fields. Thus, we cannot 
generate supersymmetry-breaking interactions that are mass terms for the 
fermion field of a chiral multiplet or non-holomorphic cubic terms for the 
scalar fields. 

There is another difficulty with terms of the form (11.160). In models 
with Standard Model singlet scalar fields, which typically occur in concrete 
models, these two interactions can generate new quadratic divergences when 
they appear in loop diagrams [46]. 

Here is the most general supersymmetry-breaking effective Lagrangian 
that can be constructed following the rule just given that is consistent with 
the gauge symmetries of the Standard Model: 


~o 1 
Lsoft = —M?| f? — FAT CA? 


= (AayadHao€opQ a + AcYetHankap lp 
— AvyuttHatosQe — BuHaa€ogHug) — h.c. (11.161) 


I have made the convention of scaling the A terms with the corresponding 
Yukawa couplings and scaling the B terms with u. The parameters A and 
B then have the dimensions of mass and are expected to be of the order of 
m in (11.158). 

For most of the rest of these lectures, I will represent the effects of the 
hidden sector and supersymmetry breaking simply by adding (11.161) to 
the supersymmetric Standard Model. I will then consider the MSSM to be 
defined by 


L= Lr + Lp + Lw + Lsoft (11.162) 
combining the pieces from (11.82), (11.84), (11.143), (11.146), and (11.161). 
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There are two problems with this story. The first is the w term in 
the MSSM superpotential. This a supersymmetric term, and so u can 
be arbitrarily large. To build a successful phenomenology of the MSSM, 
however, we need to have u of the order of the weak scale. Ideally, u should 
be parametrically equal to (11.158). 

There are simple mechanisms that can solve this problem. A fundamen- 
tal theory that leads to the renormalizable Standard Model at low energies 
can also contain higher-dimension operators suppressed by the high-energy 
mass scale. Associate this scale with the messenger scale. Then a super- 
symmetric higher-dimension operator in the superpotential 


1 
fè yS Hu (11.163) 


leads to a u term if S' acquires a vacuum expectation value. If S is a hidden 
sector field, we could find [47] 
(S?) A? 
= — n — 11.164 
=M M’ ee) 


A supersymmetric higher dimension contribution to the Kahler potential 
1 
d0 —6' HyH, 11.165 
Jog (11.165) 


leads to a u term if ® acquires a vacuum expectation value in its F term. 
If ® is a hidden sector field, we could find [48] 
(Fə A? 

ar ag? (11.166) 
In models with weak-coupling dynamics, higher-dimension operators are 
associated with the string or Planck scale; then, these mechanisms work 
most naturally if supergravity is the mediator. However, it is also possible 
to apply these strategies in models with strong-coupling dynamics in the 
hidden sector at an intermediate scale. 

Generating the u term typically requires breaking all continuous R- 
symmetries of the model. This is unfortunate, because an R-symmetry 
might be helpful phenomenologically, for example, to keep gaugino masses 
small while allowing sfermion masses to become large, or because it might 
be difficult to break an R-symmetry using a particular explicit mechanism 
of supersymmetry breaking. In this case, it is necessary to add Standard 
Model singlet fields to the MSSM to allow all gaugino and Higgsino fields 
to acquire nonzero masses. Models of this type are presented in [49,50]. 
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The second problem involves the flavor structure of the soft supersym- 
metry breaking terms. In writing (11.161), I did not write flavor indices. 
In principle, these terms could have flavor-mixing that is arbitrary in struc- 
ture and different from that in (11.143). Then the flavor-mixing would not 
be transformed away when (11.143) is put into canonical form. However, 
flavor-mixing from the soft supersymmetry breaking terms is highly con- 
strained by experiment. Contributions such as the one shown in Fig. 11.7 
give contributions to K°, D°, and B° mixing, and to T > py and u > ey, 
that can be large compared to the measured values or limits. Theories of 
the origin of the soft terms in models of supersymmetry breaking should 
address this problem. For example, the models of gauge-mediated [52] and 
anomaly-mediated [53,54] supersymmetry breaking induce soft terms that 
depend only on the SU(2) x U(1) quantum number and are therefore auto- 
matically diagonal in flavor. A quite different solution, based on a extension 
of the MSSM with a continuous R-symmetry, is presented in [51]. 


Fig. 11.7. A dangerous contribution to K-K mixing involving gluino exchange 
and flavor mixing in the squark mass matrix. 


If I assume that the soft supersymmetry-breaking Lagrangian is diagonal 
in flavor but is otherwise arbitrary, it introduces 22 new parameters. With 
arbitrary flavor and CP violation, it introduces over 100 new parameters. 
This seems a large amount of parameter freedom. I feel that it is not correct, 
though, to think of these as new fundamental parameters in physics. The 
soft Lagrangian is computed from the physics of the hidden sector, and 
so we might expect that these parameters are related to one another as a 
part of a theory of supersymmetry breaking. Indeed, the values of these 
parameters are the essential data from which we will infer the properties of 
the hidden sector and its new high energy interactions. 

If supersymmetry is discovered at the weak interaction scale, it will be 
a key problem to measure the coefficients in the soft Lagrangian and to 
understand their pattern and implications. Most of my discussion in the 
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next two sections will be devoted to the question of how the soft parameters 
can be determined from data at the LHC and ILC. 


11.4. The Mass Spectrum of the MSSM 


11.4.1. Sfermion Masses 


Our first task in this program is to ask how the parameters of the MSSM 
Lagrangian are reflected in the mass spectrum of the superparticles. The 
relation between the MSSM parameters and the particle masses is surpris- 
ingly complicated, even at the tree level. For each particle, we will need to 
collect all of the pieces of the Lagrangian (11.162) that can contribute to 
the mass term. Some of these will be direct mass contributions; others will 
contain Higgs fields and contribute to the masses when these fields obtain 
their vacuum expectation values. In this discussion, and in the remainder 
of these lectures, I will ignore all flavor-mixing. 

Begin with the squark and slepton masses. For light quarks and leptons, 
we can ignore the fermion masses and Higgs couplings. Even with this 
simplification, though, there are two sources for the scalar masses. One is 
the soft mass term 


Lsoft = —M3|f/? . (11.167) 


The other comes from the D-term potential. The SU(2) and U(1) po- 
tentials contain the cross terms between the Higgs field and sfermion field 


contributions 
g? 10° oO ~ os 
Vite (Haz Hat Hiz Au) (FEP 
g? 1 i 1 i P 
+ 5 -2-( 5 HaHa | 5 HuHu) -(f*Yf). (11.168) 


To evalute this expression, we must insert the vacuum expectation values 
of the two Higgs fields. In terms of the angle 8 defined in (11.147), these 


are 
a=] E ) (Ha) = Ca í (11.169) 


Va” sin 3 


where v = 246 GeV so that my = gu/2. 
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Inserting the Higgs vevs into the potential (11.168), we find 


= 2 7 
Vo = F [7 (cos? 8 — sin? 8)(9°? — g°Y)] f 


f ig +a" er cos 28 (I? — 81 +Y))] f 


JF 
F 


II 


[m3 cos 28(I° — s2,Q)] F . (11.170) 


Then, if we define 
Ay = (I? — s2,Q) cos26 mz, , (11.171) 
the mass of a first- or second-generation sfermion takes the form 
my =M; + Ay (11.172) 


when contributions proportional to fermion masses can be neglected. The 
D-term contribution can have interesting effects. For example, SU(2) in- 
variance of M? implies that 


m?(€) — m’ (T) = |cos28| m} >0. (11.173) 


For some choices of parameters, the measurement of this mass difference is 
a good way to determine tan 8 [55]. 
For third-generation fermions, the contributions to the mass term from 


Yukawa couplings and from A terms can be important. For the b and b, 
these contributions come from the terms in the effective Lagrangian 


Fol? + LF5I? = lye (H9) 6)? + luob (HG) |? = mf (l? + 0P) 
|\Fxa|? = (—u (H9Y)* (ubb) + h.c. = — um, tan 8 bb + h.c. 
—Lsoft = Asy (H?) bb = Aymybd . (11.174) 


In all, we find a mass matrix with mixing between the two scalar partners 
of the b quark, 
See ee ae 
(© 7 ) Me ae (11.175) 


2_ MÈ + Ay +m? my(A, — u tan 3) 
my(Ap _ utan 8) Mz + A; + me 


with 


(11.176) 


The mass matrix for 7, 7 has the same structure. For f, T, replace tan 3 by 
cot 2. 
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The mixing terms in the mass matrices of the third-generation sfermions 
often play an important role in the qualitative physics of the whole SUSY 
model. Because of the mixing, one sfermion eigenstate is pushed down in 
mass. This state is often the lightest squark or even the lightest superpar- 
ticle in the theory. 


11.4.2. Gaugino and Higgsino Masses 


In a similar way, we can compute the mass terms for the gauginos and Hig- 
gsinos. Since the gauginos and Higgsino have the same quantum numbers 
after SU(2) x U(1) breaking, they will mix. We have seen in Section 2.4 
that this mixing plays an essential role in the working of the Higgs mech- 
anism in the limit where soft supersymmetry breaking terms are turned 
off. 

The charged gauginos and Higgsinos receive mass from three sources. 
First, there is a soft SUSY breaking term 


—Leoft = mo cit . (11.177) 
The pu superpotential term contributes 
-Lw = phy? cht . (11.178) 
The gauge kinetic terms contribute 
-L= v3 ((Hi) wT chy + (HL) @T cht) (11.179) 


Inserting the Higgs field vevs from (11.169), we find the mass term 


(aT hg? )eme (3i) l (11.180) 


with 


= mə V2mw sin 8 
mc = C i ) f (11.181) 


The mass matrix for neutral gauginos and Higgsinos also receives con- 
tributions from these three sources. In this case, all four of the states 


(b, ©, h9, h?) (11.182) 
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have the same quantum numbers after SU(2) x U(1) breaking and can mix 
together. The mass matrix is 


my 0 —MzZCgSy MZSBSw 
meS 0 mə MZCBCw —MzZSBCw (11.183) 
=MZCBSw MZCBCw 0 —p 
MZSBSw —MZSBCw —p 0 


The mass eigenstates in these systems are referred to collectively as 
charginos and neutralinos. The matrix (11.183) is complex symmetric, so 
it can be diagonalized by a unitary matrix Vo,° 


my =VoeDnV¢ . (11.184) 
I will denote the neutralinos as N°, i = 1,...,4, in order of mass with N? 
the lightest. Elsewhere in the literature, you will see these states called 


Xe or Zo . The mass eigenstates are related to the weak eigenstates by the 
transformation 


P N: 
we Nə 
AE V AS (11.185) 
no Na 


Note that the diagonal matrix Dy in (11.184) may have negative or 
complex-valued elements. If that is true, the physical fermion masses of 
the Ñ; are the absolute values of the corresponding elements of Dy. The 
phases will appear in the three-point couplings of the Ñ; and can lead to 
observable interference effects. Complex phases in Dy would provide a new 
source of CP violation. 

The chargino mass matrix (11.181) is not symmetric, so in general it is 
diagonalized by two unitary matrices 


meo = V*DoVİ . (11.186) 


I will denote the charginos as CF, i = 1,2, in order of mass with Ce the 
lighter. Elsewhere in the literature, you will see these states called x; 


or W7. The mass eigenstates are related to the weak eigenstates by the 


transformation 


(s)-v(B), (E)E) a 


©Note that this formula is different from that which diagonalizes a Hermitian matrix. A 
detailed discussion of the diagonalization of mass matrices appearing in SUSY can be 
found in the Appendix of [56]. 
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It should be noted that u are must be nonzero. If u = 0, the determi- 
nant of (11.183) vanishes and so the lightest neutralino must be massless. 
This neutralino will also have a large Higgsino content and thus an order-1 
coupling to the Z°. It is excluded by searches for an excess of invisible Z° 
decays and for Z°? > Ni No. The condition u = 0 also implies that the 
lightest chargino has a mass below the current limit of about 100 GeV. 

Often, one studies models for which m1, me, and yu are all large com- 
pared to mw and mz. The off-diagonal elements that mix the gaugino 
and Higgsino states are of the order of mw and mz. Thus, if the scale of 
masses generated by the SUSY breaking terms is large, the mixing is small 
and the individual eigenstates are mainly gaugino or mainly Higgsino. How- 
ever, there are two distinct cases. The first is the gaugino region, where 
Mı, Mo < ||. In this region of parameter space, the lightest states NÑ, & 
are mainly gaugino, while the heavy neutralinos and charginos are mainly 
Higgsino. In the Higgsino region, mı, mz > |u|, the situation is reversed 
and M, Č; are mainly Higgsino. In this case, the two lightest neutralinos 
are almost degenerate. In Fig. 11.8, I show the mass eigenvalues as a func- 
tion of the mass matrix parameters along a line in the parameter space on 
which the NA has a fixed mass of 100 GeV. As we will see in Section 6.4, 
the exact makeup of the lightest neutralino as a mixture of gaugino and 
Higgsino components is important to the study of supersymmetric dark 
matter. 

To summarize this discussion, I present in Fig. 11.9 the complete spec- 
trum of new particles in the MSSM at a representative point in its param- 
eter space. Notice that the third-generation sfermions are split off from 
the others in each group. Note also that the parameter point chosen is in 
the gaugino region. The lightest superparticle is the Nj. I will discuss the 
spectrum of Higgs bosons in Section 6.2. 


11.4.3. Renormalization Group Evolution of MSSM Parameters 


The spectrum shown in Fig. 11.9 appears to have been generated by assign- 
ing random values to the soft SUSY breaking parameters. But, actually, 
I generated this spectrum by making very simple assumptions about the 
relationships of the soft parameters, at a high energy scale. Specifically, I 
assumed that the soft SUSY breaking gaugino masses and (separately) the 
sfermion masses were equal at the scale of grand unification. The struc- 
ture that you see in the figure is generated by the renormalization group 
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Fig. 11.8. Masses of the four neutralinos and two charginos along a line in the 
SUSY parameter space on which m(N? ) = 100 GeV while the parameter u moves 
from large negative to large positive values. The parameter mı is set to mı = 
0.5m2. Note the approximate degeneracies in the extreme limits of the gaugino 
and Higgsino regions. 


evolution of these parameters from the grand unification scale to the weak 
scale. 

The renomalization group (RG) evolution of soft parameters is likely 
to play a very important role in the interpretation of measurements of the 
SUSY particle masses. Essentially, after measuring these masses, it will be 
necessary to decode the results by running the effective mass parameters 
up to a higher energy at which their symmetries might become more appar- 
ent. The situation is very similar to that of the Standard Model coupling 
constants, where a renormalization group analysis told us that the appar- 
ently random values (11.137) for the coupling constants at the weak scale 
actually corresponds to a unification of couplings at a much higher scale. 

In this section, I will write the most basic RG equations for the soft 
gaugino and sfermion masses. One further effect, which involves the Yukawa 
couplings and is important for the third generation, will be discussed later 
in Section 6.1. 

The RG equation for the gaugino masses is especially simple. This 
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Fig. 11.9. Illustrative spectrum of supersymmetric particles. The columns con- 
tain, from the left, the Higgs bosons, the four neutralinos, the two charginos, the 
charged sleptons, the sneutrinos, the down squarks, and the up squarks. The 
gluino, not shown, is at about 800 GeV. 


is because both the gaugino masses and the gauge couplings arise from 
the superpotential term (11.115), with the supersymmetry breaking terms 
arising as shown in (11.159). As I have already noted, this F-term receives 
a radiative correction proportional to the @ function as a consequence of the 
trace anomaly [34,36]. The corrections are the same for the gauge boson 
field strength and the gaugino mass. Thus, if gaugino masses and couplings 
are generated at the scale M, they have the relation after RG running to 
the scale Q: 

mi(Q) _ ai(Q) 

mi(M)  ai(M) ` 
If the F term that generates the soft gaugino masses is an SU(5) singlet, 
the soft gaugino masses will be grand-unified at M. Then, running down 
to the weak scale, they will have the relation 


(11.188) 


Mı : Mm : M3 = Qı : A: ag = 05:1: 3.5. (11.189) 


This relation of soft gaugino masses is known as gaugino unification. 
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There are other models of the soft gaugino masses that also lead to 
gaugino unification. In gauge-mediated SUSY breaking, the dynamics re- 
sponsible for SUSY breaking occurs at a scale much lower than the scale as- 
sociated with mediation by supergravity. At this lower scale M, (for exam- 
ple, 1000 TeV), some heavy particles with nontrivial SU(3) x SU(2) x U(1) 
quantum numbers acquire masses from SUSY breaking. These fields then 
couple to gauginos and generate SUSY breaking masses for those particles 
through the diagram shown in Fig. 11.10(a). The heavy particles must 
fall into complete SU(5) representations; otherwise, the coupling constant 
renormalization due to these particles between M, and the grand unifica- 
tion scale would spoil the grand unification of the gauge couplings. Then 
the diagram in Fig. 11.10(a) generates soft gaugino masses proportional to 
a(M,). Running these parameters down to the weak scale, we derive the 
relation (11.189) from this rather different mechanism. 


a.) b.) 


Fig. 11.10. Diagrams that generate the soft mass parameters in gauge mediated 
supersymmetry breaking: (a.) gaugino masses; (b.) sfermion masses. 


Now let us turn to the RG running of soft scalar masses. In principle, 
there are two contributions, one from the RG rescaling of the soft mass 
term M? and one from RG evolution generating M? from the gaugino 
mass. The Feynman diagrams that contribute to the RG coefficients are 
shown in Fig. 11.11. The two one-loop diagrams proportional to M? cancel. 
The third diagram, involving the gaugino mass, gives the RG equation 


=-=S_ a(Q) Crm (Q), (11.190) 


with i = 1,2,3 and C(r;) the squared charge in the fermion representation 
ri under the gauge group i. This equation leads to a positive contribution 
to M? as one runs the RG evolution from the messenger scale down to the 
weak scale. The effect is largest for squarks, for which the SUSY breaking 
mass is induced from the gluino mass. 

As an example of this mechanism of mass generation, assume gaugino 
unification and assume that M? = 0 for all sfermions at the grand unifica- 
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Fig. 11.11. Diagrams that generate the renormalization group evolution of the 
soft sfermion mass parameters M}. 


tion scale. Then the weak scale sfermion masses will be in the ratio 


M(e) : M(é) : M(d) : MĚ) : M(d,%) : m 
= 0.5 : 0.9 : 3.09 : 3.10 : 3.24: 1 (11.191) 


This model of fermion mass generation is called no-scale SUSY breaking. It 
has the danger that the lightest stau mass eigenstate could be lighter than 
than the M, leading to problems for dark matter. This problem can be 
avoided by RG running above the GUT scale [57]. Alternatively, it might 
actually be that the lightest Standard Model superpartner is a long-lived 
stau that eventually decays to a tau and a gravitino [58,59]. 

In gauge-mediated SUSY breaking, the diagram shown in Fig. 11.10(b) 
leads to the qualitatively similar but distinguishable formula 


2 
M? =2 0 oa? (M)C2(r:) - (=) (11.192) 
t 

Each model of SUSY breaking leads to its own set of relations among 
the various soft SUSY breaking parameters. In general, the relations are 
predicted for the parameters defined at the messenger scale and must be 
evolved to the weak scale by RG running to be compared with experiment. 
Figure 11.12 shows four different sets of high-scale boundary conditions for 
the RG evolution, and the corresponding evolution to the weak scale. If we 
can measure the weak-scale values, we could try to undo the evolution and 
recognize the pattern. This will be a very interesting study for the era in 

which superparticles are observed at high energy colliders. 
There are some features common to these spectra that are general fea- 

tures of the RG evolution of soft parameters: 


(1) The pairs of sleptons @ and € can easily acquire a significant mass 
difference from RG evolution, and they might also have a different 


initial condition. It is important to measure the mass ratio m(e)/m/(e) 
as a diagnostic of the scheme of SUSY breaking. 
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Fig. 11.12. Evolution of squark and slepton masses from the messenger scale 
down to the weak scale, for four different models of supersymmetry breaking: 
(a.) universal sfermion masses at the grand unification scale My; (b.) sfermion 
masses at My that depend on the SU(5) representation; (c.) universal sfermion 
masses at an intermediate scale; (d.) gauge mediation from a sector of mass about 
1000 TeV. 


(2) Gaugino unification is a quantitative prediction of certain schemes of 
SUSY breaking. It is important to find out whether this relation is 
correct or not for the real spectrum of superparticles in Nature. 

(3) When the RG effects on the squark masses dominate the values of M? 
from the initial condition, the various species of squark have almost the 
same mass and are much heavier than the sleptons. It is important to 
check whether most or all squarks appear at the same threshold. 


11.5. The Measurement of Supersymmetry Parameters 


11.5.1. Measurements of the SUSY Spectrum at the ILC 


Now that we have discussed the physics that determines the form of the 
spectrum of superparticles, we turn to the question of how we would deter- 
mine this spectrum experimentally. This is not as easy as it might seem. In 
this section, I will consider only models in which the dark matter particle 
is the Nj, and all other SUSY particles decay to the N,. This neutral and 
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weakly interacting particle would escape a collider detector unseen. Never- 
theless, methods have been worked out not only to measure the masses of 
superparticles but also to determine mixing angles and other information 
needed to convert these masses to values of the underlying parameters of 
the MSSM Lagrangian. 

Similar methods apply to other scenarios. For example, in models in 
which the neutralino decays to a particle with gravitational interactions, 
one would add that decay, if it is visible, to the analyses that I will present. 
It is possible in models of this type that the lightest Standard Model su- 
perpartner would be a charged slepton that is stable on the time scale of 
particle physics experiments. That scenario would produce very striking 
and characteristic events [58]. 

Most likely, this experimental study of the SUSY spectrum will begin 
in the next few years with the LHC experiments. However, at a hadron 
collider like the LHC, much of the kinematic information on superparticle 
production is missing and so special tricks are needed even to measure the 
spectrum. The study of supersymmetry should be much more straightfor- 
ward at an ete" collider such as the planned International Linear Collider 
(ILC). For this reason, I would like to begin my discussion of the experi- 
ments in this section by discussing SUSY spectrum measurements at et e7 
colliders. More complete reviews of SUSY measurements at linear colliders 
can be found in [60,61]. 

I first discuss slepton pair production, beginning with the simplest pro- 


te~ — mu~ and considering successively the production of F and €. 


cess, € 
Each step will bring in new complexities and will allow new measurements 
of the SUSY parameters. 

The process ete~ — Mm, where jz is the partner of either the left- 
or right-handed u, can be analyzed with the simple formulae for scalar 
particle-antiparticle production. The cross section for pair production from 
polarized initial electrons and positrons to final-state scalars with definite 


SU(2) x U(1) quantum numbers is given by 


da TO? 4 
yg a sin? 0 | fabl? , (11.193) 


where 


(I2 + 82) (I3 +82) s 


Co Sw 


fa = 14 5 (11.194) 


s— mz 
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and, in this expression, I’ = —3,0 for a,b = L,R. For the initial state, 
a = L denotes the state ez ep and a = R denotes eRe. For the final state, 
b = L denotes the 4, b = R the J: Notice that this cross section depends 
strongly on the polarization states: 


lfa? =1.69 epef op E 
= 0.42 ezek” mu 
=0.42 ege, >H U 
=1.98 ezek oma (11.195) 


The angular distribution is characteristic of pair-production of a spin 0 
particle; the normalization of the cross sections picks out the the correct 
set of SU (2) x U(1) quantum numbers. 

If the smuon is light, its only kinematically allowed decay might be 
bm uN? . Even if the smuon is heavy, if the Ñ, is mainly gaugino, this 
decay should be important. As noted above, I am assuming that R-parity 
is conserved and that the N is the lightest particle in the superparticle 
spectrum. Then events with this decay on both sides will appear as 


ete — utu“ + (missing E and p) (11.196) 
The spectrum of the observed muons is very simple. Since the jz has spin 
0, it decays isotropically in its own rest frame. In ete~ production at a 
definite center of mass energy, the ju is produced at a definite energy, and 
thus with a definite boost, in the lab. The boost of an isotropic distribution 
is a flat distribution in energy. So, the muon energy distribution should be 
flat, between endpoints determined by kinematics, as shown in the idealized 
Fig. 11.18. 


E(u) 


Fig. 11.13. Schematic energy distribution of final-state muons in ete” — fit fi. 
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Fig. 11.14. Energy distribution of muons from ete~ > ba at the ILC, in a 
simulation by Blair and Martyn that includes realistic momentum resolution and 
beam effects [63]. 


The endpoint positions are simple functions of the mass of the ñ% and 
the mass of the Nj, 
m? (ji) — m?(Ni) 
ami) 
where y = Eom/2m(fi), 3 = (1—4m?(fi)/E2y,)*/?. If we can identify both 
endpoint positions, we can solve for the two unknown masses. Figure 11.14 


Ex, = 7(1+8) (11.197) 


shows a simulation of the reconstructed smuon energy distribution from 7 
pair production at the ILC [63]. The high-energy edges of the distributions 
are rounded because of initial-state radiation in the ete~ collision. The 
experimenters expect to be able to measure this effect and correct for it. 
Then they should obtain values of the smuon mass to an accuracy of about 
one hundred MeV, or one part per mil. 

A similar analysis applies to e'e~ — T+7—, but there are several com- 
plications. First, for the 7 system, mixing between the 7 and the F might 
be important, especially if tan 8 is large. The production cross sections 
are affected directly by the mixing. For example, to compute the pair- 
production of the lighter 7 mass eigenstate from a polarized initial state, 
eRe} > 7, 7, we must generalize (11.193) to 


do TQ 


2 
3 gin? 2 
PT 25 B? sinf O| fral , (11.198) 
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where 
fri = frrcos” 6, + frisin? 6, (11.199) 


and 6, is the mixing angle associated with the diagonalization of the T case 
of (11.176). 

Second, while the 7 can decay to Tab through gauge couplings, this 
weak eigenstate can also decay to Tr ha through terms proportional to the 
Yukawa coupling. Both decay amplitudes contribute to the observable de- 
cay T1 > rN? . With the 7 mixing angle fixed from the measurement 
of the cross section, the 7 polarization in 7 decays can be used to deter- 
mine the mixing angles in the diagonalization of the neutralino mass matrix 
(11.183) [62]. 
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Fig. 11.15. Energy distribution of the three-pion system from ete” — 7, Tj at 


the ILC, with a 7 decay to 37, in a simulation by Blair and Martyn that includes 
realistic momentum resolution and beam effects. [63]. 


In Fig. 11.15, I show the distribution of total visible energy in T —> 
3m + y+ No at the ILC. Though there is no longer a sharp feature at the 
kinematic endpoint, it is still possible to accurately determine the 7 mass 
by fitting the shape of this distribution. 

The physics of ete~ — é€*é~ brings in further new features. In this 
case, there is a new Feyman diagram, involving ¢t-channel neutralino ex- 
change. The two diagrams contributing to the cross section for this process 
are shown in Fig. 11.16. The t-channel diagram turns out to be the more 
important one, dominating the s-channel gauge boson exchange and gen- 
erating a large forward peak in selectron production. The cross section for 
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ene, >E é is given by another generalization of (11.193), 


do Ta 3. 
Toos = zs Pit OlFael’ , (11.200) 
where 
Frr= f bP Vu e (11.201) 
ve ae Cw m? —t , i 


i 
with the sum running over neutralino mass eigenstates. The factor Vor; is 
a matrix element of the unitary matrix introduced in (11.184). 


e et 


Fig. 11.16. Feynman diagrams contributing to e'e~ — & èT. 


~+ 


The t-channel diagram also allows new processes such as eze; >€ €. 
Note the correlation of the initial-state electron and position spins with the 


identities of the final-state selectrons. A complete set of polarized cross 


sections for selectron pair production in e*e~ and e~e™ collisions can be 
found in [64]. 


The cross sections for chargino and neutralino pair production in et e~ 


e 
collisions are somewhat more complicated, but still there are interesting 
things to say about these processes. Chargino pair production is given by 
the Feynman diagrams shown in Fig. 11.17. These diagrams are just the 
supersymmetric analogues of the diagrams for ete — WtW. As in that 
process, the most charcteristic final states are those with a hadronic decay 
on one side of the event and a leptonic decay on the other side, for example, 


Ct + ttyN®?, Cl > dun? . (11.202) 
A typical event of this kind is shown in Fig. 11.18. 


The chargino and neutralino production cross sections have a strong de- 
pendence on the mixing angles in (11.184) and (11.186) and offer a number 
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Fig. 11.17. Feynman diagrams contributing to ete” — æ 7. 


Fig. 11.18. A simulated chargino pair production event at the ILC [65]. 


of strategies for the determination of these mixing angles. Let me present 
one such strategy here. Consider the reaction from a polarized initial state 
eRe} > Cr Gt . Since we have an initial ep, the ¢-channel diagram vanishes 
because the right-handed electron does not couple to the neutrino. Now 
simplify the s-channel diagram by considering the limit of high energies, 
s > mz. In this limit, it is a good approximation to work with weak gauge 
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eigenstates (B°, W?) rather than the mass eigenstates (y,Z°). The weak 
eigenstate basis gives a nice simplification. The initial ep couples only to 
B°. But w+ couple only to W®, so at high energy the s-channel diagram 
gets contributions only from the Higgsino components of the on and Čt 
eigenstates. If we go to still higher energies, s > m(Ĝ1)?, there is a further 
simplification. The cross section for hght production is forward-peaked, 
and the cross section for hih production is backward-peaked. Then, the 
cross section for ene? —> CI Cf takes the form 


2 

n 3 a2; lala + cos 6)? + |V21|2(1 — cos6)?] . (11.203) 
In this limit, it is clear that we can read off both of the mixing angles in 
(11.186) from the shape of this cross section. 

The use of high-energy limits simplified this analysis, but the sentivity 
of this cross section to the chargino mixing angles is not limited to high 
energy. Even relatively close to threshold, the polarized cross sections for 
chargino production depend strongly on the chargino mixing angles and can 
be used to determine their values. In Fig. 11.19, I show contours of constant 
cross section for epe} > Cr Ct in the (m2, u) plane (for tan 8 = 4 and 
assuming gaugino unification) [66]. The value of this cross section is always 
a good measure of whether the SUSY parameters in Nature put us in the 
gaugino or the Higgsino region of Fig. 11.8. 


11.5.2. Observation of SUSY at the LHC 


Now we turn to supersymmetry production processes at the LHC. This 
subject, though more difficult, has immediate importance, since the LHC 
experiments are just about to begin. 

The reactions that produce superparticles are typically much more com- 
plicated at hadron colliders than at lepton colliders. This is true for several 
reasons. High energy collisions of hadrons are intrinsically more compli- 
cated because the final states include the fragments of the initial hadrons 
that do not participate in the hard reaction. More importantly, the dom- 
inant reactions at hadron colliders are those that involve strongly inter- 
acting superparticles. This means that the primary particles are typically 
the heavier ones in the spectrum, which then decay in several steps. In 
addition, large backgrounds from QCD obscure the signatures of super- 
symmetric particle production in many channels. 

Because of these difficulties, there is some question whether SUSY par- 
ticle production can be observed at the LHC. However, as I will explain, 


Supersymmetry in Elementary Particle Physics 671 


600 


TEA 
SS 
Se] 
setetees 
2 
sete 
SoS 
Se 
SE 
Bes 
555 
Be 
Z 
3 Soe 
se 
oe 


8 


s 
5 


5 


a L 
SOT 

SSN IN ASAA 
6) k SS 


-600 -300 0 300 600 
u (GeV) 


Fig. 11.19. Contours of constant cross section for the process epe} — Cy CÌ (in 
fb, for Ecm = 500 GeV), as a function of the underlying SUSY parameters [66]. 
The region shown is that in which the lightest chargino mass varies from 50 to 
200 GeV. For fixed C] mass, the cross section increases from zero to about 150 
fb as we move from the gaugino region into the Higgsino region. 


the signatures of supersymmetry are still expected to be striking and char- 
acteristic. It is not so clear, though, to what extent it is possible to measure 
the parameters of the SUSY Lagrangian, as I have described can be done 
from ILC experiments. This is an important study that still offers much 
room for new ideas. 

The discovery of SUSY particles at the LHC and the measurement of 
SUSY parameters has been analyzed with simulations at a number of pa- 
rameter points. Collections of interesting studies can be found in [63,67,68]. 

The dominant SUSY production processes at the LHC are 


99-99, ga-9¢ (11.204) 


These cross sections are large—tens of pb in typical cases. The values 
of numerous SUSY production cross sections at the LHC are shown in 
Fig. 11.20 [70]. 

We have seen that the squarks and gluinos are typically the heaviest 
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Fig. 11.20. Cross sections for the pair-production of supersymmetric particles at 
the LHC, from [70]. 


particles in the supersymmetry spectrum. The gluinos and squarks thus 
will decay to lighter superparticles. Some of these decays are simple, e.g., 


q—>qgN?. (11.205) 
However, other decays can lead to complex decay chains such as 
G—> qN? > g(éte-)N?, = G > udCt —> udWt N? . (11.206) 


With the assumptions that R-parity is conserved and that the N? is 
the LSP, all SUSY decay chains must end with the NP, which is stable 
and very weakly interacting. SUSY production processes at hadron collid- 
ers then have unbalanced visible momentum, accompanied by multiple jets 
and, possibility, isolated leptons or W and Z bosons. Momentum balance 
along the beam direction cannot be checked at hadron colliders, because 
fragments of the initial hadrons exit along the beam directions, but an im- 
balance of transverse momentum will be visible and can be a characteristic 
signature of new physics. SUSY events contain this signature and the gen- 
eral large activity characteristic of heavy particle production. A simulated 
event of this type is shown in Fig. 11.21. 
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Fig. 11.21. Simulated SUSY particle production event in the CMS detector at 
the LHC [69]. 


Figure 11.22 shows a set of estimates given by Tovey and the ATLAS 
collaboration of the discovery potential for SUSY as a function of the LHC 
luminosity [71]. The most important backgrounds come from processes that 
are themselves relatively rare Standard Model reactions with heavy particle 
production, 


pp > (W, Z, tt) + jets . (11.207) 


With some effort, we can experimentally normalize and control these back- 
grounds and reliably discovery SUSY production as a new physics process. 
In the figure, the contours for 50 excesses of events above these backgrounds 
for various signatures of SUSY events are plotted as a function of the so- 
called ‘mSUGRA’ parameters. The SUSY models considered are defined as 
follows: Assume gaugino unification with a universal gaugino mass mı /2 at 
the grand unification scale. Assume also that all scalar masses, including 
the Higgs boson mass parameters, are unified at the grand unification scale 
at the value mp. Assume that the A parameter is universal at the grand 
unification scale; in the figures, the value A = 0 is used. Fix the value 
of tan @ at the weak scale. Then it is possible to solve for u and B, up 
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to a sign, from the condition that electroweak symmetry is broken in such 
a way as to give the observed value of the Z° mass. (I will describe this 
calculation in Section 6.1.) This gives a 4-parameter subspace of the full 
24-dimensional parameter space of the CP- and flavor-conserving MSSM, 
with the parameters 


mo, M2, A, tanp , sign(p) . (11.208) 


This subspace is often used to express the results of phenomenological anal- 
yses of supersymmetry. In interpreting such results, one should remember 
that this choice of parameters is used for simplicity rather than being mo- 
tivated by physics. 

The figure shows contours below which the various signatures of su- 
persymmetry significantly modify the Standard Model expectations. For 
clarity, the contours of constant squark and gluino mass are also plotted. 
The left-hand plot shows Tovey’s results for the missing transverse momen- 
tum plus multijets signature at various levels of LHC integrated luminosity. 
It is remarkable that, in the models in which the squark or gluino mass is 
below 1 TeV, SUSY should be discoverable with a data sample equivalent 
to a small fraction of a year of running. The right-hand plot shows the 
contours for the discovery of a variety of SUSY signals, with up to three 
leptons plus jets plus missing transverse momentum, with roughly one year 
of data at the initial design luminosity. The signals are, as I have described, 
relatively robust with repect to uncertainties in the Standard Model back- 
grounds. This makes it very likely that, if SUSY is really present in Nature 
as the explanation of electroweak symmetry breaking, we will discover it at 
the LHC. 

The general characteristics of SUSY events also allow us to estimate the 
SUSY mass scale in a relatively straightforward way. In Fig. 11.23, I show 
a correlation pointed out by Hinchliffe and collaborators [72] between the 
lighter of the squark and gluino masses and the variable 


4 
Mer 5 =Er + 5 Er; (11.209) 
ak 


given by the sum of the transverse momenta of the four highest Er jets to- 
gether with the value of the missing transverse momentum. The correlation 
applies reasonably well to mSUGRA models. In other models with smaller 
mass gaps between the squarks and the lightest neutralino, this relation 
can break down, but Mepps still measures the mass difference between the 
squark or gluino and the N ? [73]. Some more sophisticated techniques for 
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Fig. 11.22. Estimates by the ATLAS collaboration of the observability of various 
signatures of SUSY at the LHC. The plots refer to models with grand unification 
and universal sfermion and gaugino masses Mo and Mj/;2. The left-hand plot 
shows the region of this parameter space in which it is possible to detect the 
signature of missing Er plus multiple jets at various levels of integrated luminos- 
ity. The right-hand plot shows the region of this parameter space in which it is 
possible to detect an excess of events with one or more leptons in addition to jets 
and missing Er [71]. 


determining mass scales in SUSY models from global kinematic variables 
are described in [74]. 


11.5.3. Measurements of the SUSY Spectrum at the LHC 


So far, I have only discussed the observation of the qualitative features of 
the SUSY model from global measures of the properties of events. Now I 
would like to give some examples of analyses in which specific details of the 
SUSY spectrum are measured with precision at the LHC. The examples 
that I will discuss involve the decay chain 


G—qNS , N9 > NOE, (11.210) 


which is typically seen in models in which the gluino is heavier than the 
squarks and the LSP is gaugino-like. 
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Fig. 11.23. Correlation between the value of the observable (11.209) and the 
lighter of the squark and gluino masses, from [72]. 


The decay of the N? can proceed by any of the mechanisms: 


N? > E+E, aE 
N? = N? Z? ate 
N? NOR , Z™ peg, (11.211) 


The last line indicates a virtual Z°, decaying off-shell. In a model with 
gaugino unification and heavy Higgsinos, N is mainly w° and N; is mainly 
D. Then these modes are preferred in the order listed as long as they are 
kinematically allowed. If the slepton decay is allowed, this is the domi- 
nant model. Otherwise, the decay to N 1Z° or other open two-body decays 
dominate. If no two-body decays are open, the No must decay through 
three-body processes such as the last line of (11.211). 

The decay to an on-shell Z° is hard to work with [75], but the other 
two cases can be explored in depth. It is useful to begin with the Dalitz 
plot associated with the 3-body (Ny, £+, £7) system. Let 


2E(N;) 2E (e+) 2E(£-) 


zo = , t= —— go —— , (11.212) 
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Fig. 11.24. The Dalitz plot describing 3-body neutralino decays, NS => N. 


where the energies are measured in the rest frame of the Nə. The three 
variables are related by 


£o + zı +2 =2. (11.213) 
The three-body decay phase space is given by 
m? (Na) 
T = = 11.214 
Jon Y f desde- ; (11.214) 


that is, phase space is flat in the variables (11.212). The basic kinematic 

identities involving the Dalitz plot variables are straightforward to work 

out, especially if we ignore the masses of the leptons. The kinematically 
allowed region is a wedge of the (a;,2_) plane bounded by the curves 

ny +e = 1- (m(Ni)/m(M2))? 
(1—24)(1—2_) = (m(M)/m(N2))? , (11.215) 
as shown in Fig. 11.24(a). The invariant masses of two-body combinations 
are given in terms of the x, by 
m?(Nı L) 
m?(No) 


m? (LTE) 
= (1 — x+), =— = (1 
i l m? (Nə) i m(N2)? 


jE (11.216) 
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Tam assuming that the Ñ; is stable and weakly interacting. In this case, 
the Ñ, will not be observed in the LHC experiments, and also the frame of 
the Ñ cannot be readily determined. The only property of this system that 
is straightforward to measure is the two-body invariant mass m(£t£~). So 
it is interesting to note that the distribution of this quantity distinguishes 
the first and third cases in (11.211), in the manner shown in Fig. 11.24(b). 
In the case of a two-body decay to an intermediate slepton, the decays 
populate two lines on the Dalitz plot, leading to a sharp discontinuity at 
the kinematic endpoint. In the case of a three-body decay, the events 
fill the whole Dalitz plot, producing a distribution with a slope at the 
endpoint. With a good understanding of the detector resolution in the 
dilepton invariant mass, these cases can be distinguished experimentally. 

In the three-body case, the endpoint of the dilepton mass distribution 
is exactly 


m(N) — m(N) , (11.217) 


so the observable mass distribution gives a precise measurement of this 
SUSY mass difference. The shape of the spectrum has more information. 
For example, for heavy slepton masses, the shape is distinctly different 
for gaugino-like or Higgsino-like neutralinos. Figure 11.25(a) shows the 
dilepton mass distribution for an mSUGRA parameter set for which the 
lightest two neutralinos are gaugino-like [72]. Figure 11.25(b) shows this 
distribution for a parameter set in which the two lightest neutralinos are 
Higgsino-like [73]. 

At the endpoint, the dilepton mass is maximal, and this requires that 
both the dilepton pair and the Nj are at rest in the frame of the No. By 
measuring the four-vectors of the leptons, we would then know the N; and 
Nə four-vectors, up to knowledge of the Nı mass. It is possible to obtain 
this mass approximately from other measurements, for example, from the 
kinematics of 7 decays directly to Ni. With this information, we could 
determine the N2 four-vector. Now the problem of missing momentum is 
solved. By adding observed jets to the Nə four-vector, it is possible to find 
squarks as resonances [72]. Figure 11.26 shows the result of such an analysis 
for the SUSY parameter set of Fig. 11.25. The peak just below 300 GeV is 
a reconstructed b squark. 

The two-body case of No decay is even nicer. In this case, we can see 
from the right-hand figure in Fig. 11.24(b) that the endpoint of the dilepton 
mass distribution is not located at the mass difference (11.217) but instead 
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Fig. 11.25. Distribution of the dilepton invariant mass in two supersymmetry 
models with 3-body neutralino decays: (a.) a model with gaugino-like neutrali- 
nos [72], (b.) a model with Higgsino-like neutralinos [73]. In the second figure, 
the dashed curve indicates the m(£*é~) spectrum expected for gaugino-like neu- 
tralinos with the same mass splitting. 
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Fig. 11.26. Reconstruction of a squark in the model of Fig. 11.25(a) by combining 


a dilepton pair at the endpoint of the m(€*é~) distribution, the N? in the same 
frame with mass determined from kinematics, and a b-tagged quark jet. 


Events/1 GeV/100 fb” 


N 
[=] 
i=] 


800 
600 


400 


20 


800F ii 
t yi 
600- 4 | i 
[ ity 
[ ath i 
400- if 
i t 
4h { 
F i t 
200- ttt 
ah 
“4060 80 100 % 20. 40 60 80 ~ 100 
m, (GeV) m, (GeV) 


Fig. 11.27. Dilepton mass distribution in a model with two-body No decays, 
from [63]. The left-hand plot shows the dilepton mass distributions for opposite- 
sign same-flavor dileptons (solid) and for opposite-sign opposite-flavor dileptons 
(dashed). The lower histograms give the estimates of the Standard Model back- 
ground. The right-hand plot shows the difference of the two distributions. 
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at the smaller value 
~ 2(0) 2(N. 
m(lte-) = mip m i) al (11.218) 


Figure 11.27 shows an example of the dilepton spectrum from a SUSY 
parameter point in this region [63] The decay g — qNo is also a two- 
body decay, and there are similar kinematic relations for the upper and 
lower endpoints of the (q£) and (qé@) invariant mass distributions. These 
endpoints are likely to be visible in the collider data. Figure 11.28 shows 
two jet-lepton mass distributions from a similar analysis presented in [76]. 
In that analysis, it was possible to identify five well-measured kinematic 
endpoints, from which it was possible to solve (in an overdetermined way) 
for the four masses m(N1), m(@), m(N2), mQ). 

There is one more case of an No => M decay that should be mentioned. 
If two-body decays of Na to sleptons are not kinematically allowed but 
the decay to N,h° is permitted, this decay to a Higgs boson will be the 
dominant No decay. In this case, supersymmetry can provide a copious 
source of Higgs bosons. Figure 11.29 shows an analysis of a SUSY model 
in this parameter region [67]. Events with multijets and missing transverse 
energy are selected. In this sample, the mass distribution of two b-quark- 
tagged jets is shown. The signature of SUSY selects a sample of events in 
which the Higgs boson is visible in its dominant decay to bb. 
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Fig. 11.29. The dijet mass distribution for 2 b-tagged jets at a point in the SUSY 
parameter space where the decay N9 — h° NY} is dominant, from [67]. 


There is much more to say about the measurement of SUSY parameters 
at the LHC. Some more sophisticated sets of variables are introduced and 
applied in [76,77]. The question of measuring the spins of superparticles is 
discussed in [78-81]. And, we have not touched on alternative possibilities 
for the realization of SUSY, with R-parity violation or charged superpar- 
ticles that are observed in the LHC experiments as stable particles. A 
broader overview of SUSY phenomenology at the LHC can be found in the 
references cited at the beginning of this section. 


11.6. Electroweak Symmetry Breaking and Dark Mat- 
ter in the MSSM 


11.6.1. Electroweak Symmetry Breaking in the MSSM 


In Section 1.2, I motivated the introduction of SUSY with the claim that 
SUSY could give an explanation of electroweak symmetry breaking, and for 
the presence of weakly interacting dark matter in the universe. Now that 
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M2 


Fig. 11.30. Diagram contributing a term to the renormalization group equation 
for the soft mass parameter of t proportional to the soft mass parameter for Ha. 


we have a detailed understanding of the structure of the MSSM, it is time 
to come back and discuss these issues. 

To present the mechanism of electroweak symmetry breaking in the 
MSSM, I need to add a term to one of the equations that I derived in 
Section 4.3. In (11.190), I presented the RG equation for the soft SUSY 
breaking scalar mass parameters, including renormalization effects from 
gauge interactions. I remarked that the contributions to this equation from 
Higgs Yukawa couplings are small for the scalars of the first and second gen- 
erations. However, for the scalars of the third generation, these corrections 
can plan an important role. 

The F-term interaction 


L=-|yH- i (11.219) 


leads to a contribution to the RG equations for M4, the mass parameter of 
t, proportional to M,,,, from the diagram shown in Fig. 11.30. The value 
of the diagram is 


iy? d'k i M2.) to aaye A? 
Mf (any ga Mira) 5 = (ama "t Mita 108 (11.220) 


A scalar self-energy diagram is interpreted as —idm?, so this is a negative 
contribution to M?. Each of the scalar fields (H,,,t,#) gives a similar con- 
tribution that renomalizes the soft mass parameter of each of the others. 
For each correction, there is a counting factor from the number of color or 
SU(2) degrees of freedom that run around the loop. There is also a correc- 
tion to each of the scalar masses from the top quark A term. We must also 
remember that all of these terms add to the positive mass correction from 
the gaugino loops in Fig. 11.11, of which the gluino loop correction is the 
most important. 
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Taking all of these effects into account, we find for the RG equations of 
the soft mass parameters of H,,, t, and t 


dM? 2 2 2 2 2 2 8 2 
= - 1- y2 [M? + M? + M2, + 4?] - — + 

dlogQ (47)? yl t E Hu i] 3g 033 
dM2 2 

—t_ = —— 2. y? |M? + M? + Mz, +A? A 
dlogQ (47)? yl t E Hu i] Be 033 
dM? 2 

Hu — —_ .3- y?[M? + M? + Mj, tAm +- (11.221) 


dlogQ (4r)? 


The structure is very interesting. The three scalar fields H,, t, and Ẹ 
all receive negative corrections to their mass terms as these equations are 
integrated in the direction of decreasing log Q. If any of these mass terms 
were to become negative, the corresponding field would have an instability 
to develop a vacuum expectation value, and the symmetry of the MSSM 
would be spontaneously broken. The symmetry-breaking we want is that 
associated with (H,,) 4 0. However, it seems equally possible that we could 
generate (t) Æ 0, which would break color SU(3), or (t) 4 0, which would 
break both SU(2) and SU(3). 

If the three mass parameters have similar values at a high mass scale, 
they race toward negative values according to (11.221). But H, wins the 
race, and so the theory predicts the symmetry breaking pattern that is the 
one observed. In this way, the MSSM leads naturally to electroweak sym- 
metry breaking and realizes the idea that electroweak symmetry breaking 
is connected to the large value of the top quark-Higgs coupling. 


11.6.2. Higgs Boson Masses in the MSSM 


Once we expect that M? < 0 at the weak scale, we can work out the details 
of the Higgs boson spectrum. First, we should write the potential for the 
Higgs fields H,,, Ha. As in the discussion of Sections 4.1 and 4.2, a number 
of terms need to be collected from the various pieces of the Lagrangian. 
The F terms contriubute 
Vr = u?’ (H9* H? + HoH?) (11.222) 
The D terms contribute 
De 2 

Vp = PT ns He — H” H9)? (11.223) 

The soft SUSY breaking terms contribute 


Vzoft = Mj, Ho" H? + M? H9 HY — (BuHL HG + h.c.) (11.224) 
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The sum of these terms gives the complete tree-level Higgs potential. Differ- 
entiating this potential with respect to H} and H9, we obtain the equations 
that determine the Higgs field vacuum expectation values. If we write these 
equations with the parametrization of the vacuum expectation values given 
in (11.169), we find 


1 
pw? + M2, = Bucot B+ az cos 26 


1 
KR? + Mz = Butan Bb — az cos 28 , (11.225) 


where mz = (g? + g’*)v?/4. This system of equations can be solved for yu 

to give 

2_ M?,, — tan? BMj,,, 
tan? 8—1 


1 
u Zz (11.226) 
This is, for example, the way that we would determine u in the mSUGRA 
parameter space described in Section 5.2. 
It is interesting to turn this equation around and write it as an equation 
for mz in terms of the SUSY parameters, 


2 _ o Ma- tan? 6M}, 


= Qu? . 11.227 
mz tan? B Sa H ( ) 


From this equation, a small value of mz would require a cancellation be- 
tween the Higgs soft mass parameters and u. The parameter u sets the 
mass scale of the Higgsinos, and the Higgs soft mass parameters might be 
related to other masses of the SUSY scalar particles. Thus, if the masses of 
the charginos and neutralinos and, perhaps also, the sleptons are not close 
to mz, that disparity must be associated with an apparently unnatural 
cancellation between different SUSY parameters. 

If we prohibit a delicate cancellation in (11.227), we put an upper bound 
on the SUSY partner masses. To avoid cancellations in more than two dec- 
imal places, u must be less than 700 GeV. Similarly, we find bounds on the 
Higgs soft masses, and on the parameters that contribute to these masses 
through the RG equation. This consideration turns out to give a constraint 
on the gluino mass, m3 < 800 GeV. Assuming gaugino universality, this 
becomes a condition mz < 250 GeV that restricts the chargino and neu- 
tralino masses. A variety of similar naturalness arguments that constrain 
the SUSY scale can be found in [82-84]. Though the logic is that of an 
estimate rather than a rigorous bound, this analysis strongly supports the 
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idea that SUSY partners should be light enough to be discovered at the 
LHC and at the ILC. 

Once we have the Higgs potential and the conditions for the Higgs vac- 
uum expectation values, we can work out the masses of the Higgs bosons 
by expanding the potential around its minimum. A first step is to identify 
the combinations of Higgs fields that correspond to physical Higgs bosons. 
Look first at the charged Higgs bosons. There are two charged Higgs fields 
in the multiplets H,,, Ha. One linear combination of these fields is the Gold- 
stone boson that is eaten by the W boson as it obtains mass through the 
Higgs mechanism. The orthogonal linear combination is a physical charged 
scalar field. If we decompose 


H+ = cos BH* + sin GGT 
Hı = sin 6H + sin pG (11.228) 


where H~ = (Ht)*, G- = (G*)*, and @ is precisely the mixing angle in 
(11.169), it can be seen that G* are the Goldstone bosons and H= are the 
physical scalar states. 

A similar analysis applies to the neutral components of H} and H°. 
These are complex-valued fields. It is appropriate to decomposed them as 


1 
H? = RA + sin aH? + cos ah? + i cos BA? + isin 8G?) 
1 
H} = e cosa H? — sinah? + isin 8A? — icos 86G?) (11.229) 


The components H®, h? are even under CP; the fields 4°, G? are odd under 
CP. The componet G® is the Goldstone boson eaten by the Z°. The other 
three fields create physical scalar particles. 

Having identified these fields, we can compute their masses. The for- 
mulae for the Higgs masses take an especially simple form when they are 
expressed in terms of the mass of the A°. For the charged Higgs boson 


mz, =m, + miy . (11.230) 
For the CP-even scalars, one finds a mass matrix 


( m? sin? 8 + m2, cos? 8 —(m?, + m2) sin 8 cos J 


11.231 
—(m?, + m2.) sin 6 cos 6 m? cos? B + m3 sin? 8 ( ) 


The physical scalar masses m? and mî; are the eigenvalues of this matrix, 
defined in such a way that m? < m2,. The angle a in (11.229) is the mixing 
angle that defines these eigenstates. 
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Taking the trace of (11.231), we find the relation 
m +m =m tm}. (11.232) 


We can also obtain an upper bound on the lighter Higgs mass m7? by taking 
the matrix element of (11.231) in the state (cos 8, sin 3). The bound is a 
very strong one: 


m? < mz cos? B < m3. (11.233) 


This seems inconsistent with lower bounds on the Higgs boson mass from 
LEP 2, which exclude mp < 114 GeV for the Standard Model Higgs and for 
most scenarios of SUSY Higgs bosons [85].1 However, the one-loop correc- 
tions to the tree-level result (11.231) give a significant positive correction 


3 4 MN 
õm? = BLE log 4 . (11.234) 
T My mọ 


This correction can move the mass of the h? up to about 130 GeV. The 
detailed summary of the radiative corrections to the h? mass in the MSSM 
is presented in [88]. A very clear and useful accounting of the major cor- 
rections can be found in [89]. 

It is possible to raise the mass of the h? by going outside the MSSM 
and adding additional SU(2) singlet superfields to the model. However, this 
strategy is limited by a general constraint coming from grand unification. 
The requirement that the Higgs couplings do not become strong up to the 
grand unification scale limit the mass of the Higgs to about 200 GeV [90]. 
It is possible to raise the mass of the Higgs further only by enlarging the 
Standard Model gauge group or adding new thresholds that affect unifica- 
tion [91,92]. 

In the MSSM, we can easily have the situation in which m4 > mp. In 
this limit, the couplings of the h? are very close to those of the Standard 
Model Higgs boson, and the H®, A°, and HË are almost degenerate. If 


tan 3 > 1, the heavy neutral Higgs bosons decay dominantly to bb and 
+ 


Ye g 
Much more about the phenomenology of Higgs bosons in supersymmetry 


can be found in [93,94]. 


11.6.3. WIMP Model of Dark Matter 


Now we turn to the second problem highlighted in the Introduction, the 
problem of dark matter in the universe. It has been known from many 


dSome exceptional Higgs decay schemes that escape these bounds are considered 
in [86,87]. 
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astrophysical measurements that the universe contains enormous amounts 
of invisible, weakly interacting matter. For an excellent review of the classic 
astrophysical evidence for this dark matter, see [95]. 

In the past few years, measurements of the cosmic microwave back- 
ground have given a new source of evidence for dark matter. Since this 
data comes from an era in the early universe before the formation of any 
structure, it argues strongly that the invisible matter is not made of rocks or 
brown dwarfs but is actually a new, very weakly interacting form of matter. 
These measurements also determine quite accurately the overall amount of 
conventional and dark matter in the universe. Let py, py, and pa be the 
large-scale energy densities of the universe from baryons, dark matter, and 
the energy of the vacuum. The data from the microwave background tells 
us that pp + px + Pa = fc, the ‘closure density’ corresponding in general 
relativity to a flat universe, to about 1% accuracy. If Q; = p;/pc, the most 
recent data from the WMAP experiment and other sources gives [96,97] 


Qs = 0.042 + 0.003 Qy =0.2040.02 OQ, =0.74+0.02. (11.235) 


These results present a double mystery. We do not know what particle the 
dark matter is made of, and we do not have any theory that explains the 
observed magnitude of the vacuum energy or ‘dark energy’. 

I believe that supersymmetry will eventually play an essential role in 
solving the problem of dark energy. In ordinary quantum field theory, 
the value of the vacuum energy is quartically divergent, so the problem of 
computing the vacuum energy is not even well-posed. In supersymmetry, 
there is at least a well-defined zero of the energy associated with exact 
supersymmetry, which implies (0| H |0} = 0. Unfortunately, in most of 


today’s models of supersymmetry, the vacuum energy is set by the SUSY 
breaking scale. This gives A ~ (10!! GeV)“, about 80 orders of magnitude 
larger than the observed value of the vacuum energy. From this starting 
point, A must be fine-tuned to the scale of eV*. This is an important 
problem that needs new insights which, however, I will not provide here. 
On the other hand, supersymmetry offers a very definite solution to the 
problem of the origin of dark matter. We have already noted in Section 3.4 
that it is straightforward to arrange that the lightest supersymmetric par- 
ticle can be absolutely stable. If this particle were produced in the early 
universe, some density of this type of matter should still be present. In 
most, but not all, regions of parameter space, the lightest supersymmetric 
particle is neutral. Candidates include the lightest neutralino, the light- 
est sneutrino, and the gravitino. In the remainder of these lectures, I will 
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concentrate on the case in which the lightest neutralino is the dark matter 
particle. For a discussion of the other candidates, see [98]. 

To begin our discussion, I would like to estimate the cosmic density of 
dark matter in a more general context. Let me make the following minimal 
assumptions about the nature of dark matter, that the dark matter particle 
is stable, neutral, and weakly interacting. To these properties, I would like 
to add one more, that dark matter particles can be created in pairs at 
sufficiently high temperature, and that, at some time in the early universe, 
dark matter particles were in thermal equilibrium. I will refer to a particle 
satisfying these assumptions as a ‘weakly interacting massive particle’ or 
WIMP. The assumption of thermal equilibrium is a strong one that is not 
satisfied even in many models of supersymmetric dark matter. For some 
exceptions, see [99,100]. However, let us see what implications follow from 
these assumptions. 

The assumption that WIMPs were once in thermal equilibrium provides 
a definite initial condition from which to compute the current density of 
dark matter. In thermal equilibrium at temperture T, we have for the 
number density of dark matter particles 

g 


Sunn FS 3/2 ,—-m/T 
Neg = Or) (mT)” e (11.236) 


where g is the number of spin degrees of freedom of the massive particle. 
As the universe expands, the temperature of the universe deccreases and 
the rate of WIMP pair production becomes very small. But the rate of 
dark matter pair annihilation also becomes small as the WIMPs separate 
from one another. 

The expansion of the universe is governed by the Hubble constant H = 
a/a,where a is the scale factor. Einstein’s equations imply that 


ae (11.237) 


In a radiation-dominated universe where g, is the number of relativistic 
degrees of freedom, p = 77g,T+/30. Then H is proportional to T?. In 
a radiation-dominated universe, the temperature red-shifts as the universe 
expands, so that T ~ a~!. Combining this relation with the equation 
H = åja ~ T?, we find t ~ T7? ~ a?, that is, a ~ t!/? or a/a = 1/2t. 
Setting this expression equal to the explict form of H in (11.237), we find 
a detailed formula for the time since the start of the radiation-dominated 
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era for cooling to a temperature T, 


167° 9, “ae Mp} 
= ( n ) A, (11.238) 


The evolution of the WIMP density is described by the Boltzmann 

equation 

dn 5 5 

a —3Hn — (ov) (n° — nég) , (11.239) 
where H is the Hubble constant, ø is the NN annihilation cross section— 
which appears thermally averaged with the relative velocity of colliding 
WIMPs—and neg is the equilibrium WIMP density (11.236). Assume, just 
for the sake of argument, that the temperature T is of the order of 100 GeV. 
At this temperature, the Hubble constant has the magnitude H ~ 1071!’T, 
so the expansion of the universe is very slow on the scale of typical ele- 
mentary particle reactions. However, when T becomes less than the WIMP 
mass m, the WIMP density is exponentially suppressed and so the collision 
term in the Boltzmann equation is also very small. These two terms are of 
the same size at the freezeout temperature Tp satisfying 


e-miTe 1 (11.240) 
mpm (ov) 

At temperatures below Tr, we may neglect the production of WIMPs in 
particle collisions. The WIMP density is then determined by the expansion 
of the universe and the residual rate of WIMP pair annihilation. Maybe it 
is more appropriate to think of Tp as the temperature at which a WIMP 
density is frozen in. To determine the freezeout temperature, we take the 
logarithm of the right-hand side of (11.240). The result depends only on the 
order of magnitude of the annihilation cross section. For any interaction of 
electroweak strength, 


Er =Tr/m~ 1/25. (11.241) 


This physical picture suggests a way to estimate the cosmic density 
of WIMP dark matter. We can take as our initial condition the thermal 
density of dark matter at freezeout. We then integrate the Boltzmann equa- 
tion, ignoring the term proportional to na associated with the production 
of WIMP pairs [102]. 

In analyzing the Boltzmann equation, it is useful normalize the particle 
density n of dark matter to the density of entropy s. Since the universe 
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expands very slowly, this expansion is very close to adiabatic. Then entropy 
is conserved, 


ds 
—=-3Hs. 11.242 
hi s ( ) 
In a radiation-dominated universe, s = 2r? g, T3 /45. Now define 
T 
Se. te (11.243) 
s m 


the latter as in (11.241). Using the expression (11.238), we can convert the 
evolution in time to an evolution in temperature or in €. Applying these 
changes of variables and dropping the Tieg term, the Boltmann equation 
(11.239) rearranges to the form 


== Clov)Y? , (11.244) 
where 
_ (Tgx\ 1/2 
C= ( r ) mmp . (11.245) 


Let Yr be the value of Y at € = ép. If we assume that (cv) is approx- 
imately constant, since we are at temperatures close to threshold, it is 
straightforward to integrate this equation to € = 0, corresponding to late 
times. 


Y`! =Y; +Cêpr (ov) . (11.246) 


The second term typically dominates the first. Then we can put back the 
value of C in (11.245) and write the final answer in terms of the ratio of 
the mass density of dark matter to the closure density Qn = nmy/pc. In 
this way, we find 


45\2 1 1 
Og= ( p ) (11.247) 
Pe \T9x Ermp (ov) 


where sọ is the current entropy density of the universe. Turner and Scherrer 
observed that this formula gives a value of Qy that is usually within 10% 
of the result from exact integration of the Boltzmann equation [102]. If 
( sigmav) has a significant dependence on temperature, the derivation is 
still correct with the replacement 


of 
Pies f dé (ov) () (11.248) 


in the denominator of the last term in (11.247). 
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This is a remarkable relation. Almost every factor in this relation is 
known from astrophysical measurements. The left-hand side is given by 
(11.235). On the right-hand side, the entropy density of the universe is 
dominated by the entropy of the microwave background photons and can 
be computed from the microwave background temperature. The closure 
density is known from the measurement of the Hubble constant and the 
observation that the universe is flat. The parameters gą and p are rel- 
atively insensitive to the strength of the annihilation cross section, with 
values gą ~ 100, £p ~ 1/25. The mass of the WIMP does not appear 
explicitly in (11.247). We can then solve for (ov). The result is 


(ov) =1 pb. (11.249) 


This is the value of a typical electroweak cross section at energies of a few 
hundred GeV. If we convert this value to a mass M of an exchanged particle 
using the formula 
Ta? 
(ov) = 8M2 ’ 
the value (11.249) corresponds to M = 100 GeV. 
I consider this a truly remarkable result. From a purely astrophysical 


(11.250) 


argument, relying on quite weak and general assumptions, we arrive at the 
conclusion that there must be new particles at the hundred GeV energy 
scale. It is probably not a concidence that this argument leads us back to 
the mass scale of electroweak symmetry breaking. 

In our study of supersymmetry, we have found an argument from the 
physics of electroweak symmetry breaking that predicts the existence of 
dark matter. As I discussed at the beginning of these lectures, models 
that explain electroweak symmetry breaking are complex. They typically 
involve many new particles. It is easily arranged that the lightest of the 
new particles is neutral. In supersymmetry, there is a reason why the new 
particles are likely to carry a conserved quantum number (11.149). Other 
models of electroweak symmetry breaking, such as the extra dimensional 
and little Higgs models discussed in Section 1.2, have their own reasons 
to have a complex particle spectrum and discrete symmetries. Then these 
models lead in their own ways to WIMPs at the hundred GeV mass scale. 

A slight extension of this argument adds more interest. In supersymme- 
try, the sector of new particles includes particles with QCD color. Since the 
top quark probably plays an essential role in the mechanism of electroweak 
symmetry breaking, it is very likely that, in any model, some of the new 
particles will carry color. If these particles have masses below 1 TeV, they 
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have large (10 pb) pair-production cross sections at the LHC. These parti- 
cles will then decay to the dark matter particle, producting complex events 
with several hard jets and missing transverse momentum. These mild as- 
sumptions thus lead to the conclusion, from any model that follows this 
general line of argument, that we should expect exotic events with multiple 
jets and missing transverse momentum to appear with pb cross sections at 


the LHC. 


11.6.4. Dark Matter Annihilation in the MSSM 


This argument of the previous section gives a very optimistic conclusion 
for the discovery of new physics at the LHC. However, we have already 
discussed that the first observation of supersymmetry or another model of 
new physics will only be the first step in a lengthy experimental program. 
Once we know that superparticles or other new particles exist, we will need 
to study them in detail to learn their detailed interactions and, eventually, 
to work out the underlying Lagrangian that governs their behavior. As we 
have already discussed in Section 3.5 and 4.3, this Lagrangian can give us 
a clue to the nature of the ultimate theory at very short distances. 

The study of dark matter intersects this program in an interesting way. 
In principle, once we have discovered supersymmetric particles, we can try 
to measure their properties and see if these coincide with the properties 
required from astrophysical detections of dark matter. As we have seen in 
Section 5.3, the LHC experiments expect to measure the mass of the LSP 
to about 10% accuracy. These measurements can hopefully be compared 
to mass measurements at the 20% level that can be expected from astro- 
physical dark matter detection experiments [103,104]. We would also wish 
to find out whether the annihilation cross section (ov) that is predicted 
from the supersymmetry parameters measured at colliders agrees with the 
value (11.249) required to predict the observed WIMP relic density. This 
comparison turns out to depend in a complex way on the parameters of the 
underlying supersymmetry theory. 

To begin our discussion of the annihilation cross section, we can make 
a simple model of neutralino annihilation and see how well it works. We 
have seen in Section 4.3 that the right-handed sleptons are often the light- 
est charged particles in the supersymmetry spectrum. Consider, then, an 
idealized parameter set in which the neutralino is a pure bino and pair anni- 
hilation is dominated by the slepton exchange diagrams shown in Fig. 11.31. 
(Away from the pure bino case, there are also s-channel diagrams with Z°, 
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Fig. 11.31. Diagrams giving the simplest scheme of neutralino pair annihilation, 
leading to the annihilation cross section (11.251). 


—> <— —> —> 


Fig. 11.32. Two possible spin configurations for neutralino annihilation: (a.) spin 
0; (b.) spin 1. Because of Fermi statistics, the latter state does not exist in the 
S-wave. 


h?, H?, A°.) In this special limit, the annihilation cross section is given by 


2 2 
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Cw 


1 1 


m2 —t m2—u 
2 £ 


do 
2, 2 
= TA mMm 
dcos 0 N 


v ; (11.251) 


where my is the N; mass. The relative velocity v appears due to the flux 
factor in the cross section; this factor cancels in ov. I have ignored the 
lepton masses. This expression is of the order of (11.250) with M ~ my, 
except for one unfortunate feature: At threshold, t = u and the cross section 
vanishes. This leads to a severe suppression, by a factor of 

4 


, (11.252) 


2 
m 
v2. N 


2 2 
me +m 
7+ ™N 


which is at least of order €/16. So the relic density estimated in this simple 
way is too large by about a factor of 10. 

There is an interesting physics explanation for the vanishing of this 
cross section at threshold [105]. Neutralinos are spin-5 fermions, and we 
might guess from this that, near threshold, they would annihilate in the S- 
wave either in a spin 0 or in a spin 1 state. The two spin configurations are 
shown in Fig. 11.32. However, because the neutralino is a Majorana fermion 
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and therefore its own antiparticle, an S-wave state of two neutralinos must 
be antisymmetric in spin. Hence, the spin 1 S-wave state does not exist 
However, as we know from pion decay, a spin 0 state can convert to a pair 
of light leptons only with a helicity flip. Thus, there is an annihilation cross 
section from the spin 0 S-wave only when lepton masses are included, and 
even then with the suppression factor m? /m%,, which is 1074 even for 7+ 7~ 
final states. 

To obtain a realistic value for the neutralino relic density, we have to 
bring in more complicated mechanisms of neutralino annihilation. These 
mechanisms are not difficult to find in various regions of the large super- 
symmetry paramet er space [106-108]. We need to look for annihilation 
processes that can proceed in the S-wave with full strength. Three possible 
mechanisms are shown in Fig. 11.33. 


N N 


Fig. 11.33. Three mechanisms for obtaining a sufficiently large annihilation cross 
section to give the observed density of neutralino dark matter: (a.) gaugino- 
Higgsino mixing, opening the annihilation channels to WWT and Z°Z°, (b.) 
resonance annihilation through the Higgs boson A®, (c.) co-annihilation with 
another supersymmetric particle, here taken to be a l. 


Pairs of neutralinos can annihilate in the S-wave into vector bosons. 
The bino does not couple to W or Z pairs, but if the lightest neutralino has 
Higgsino or wino content, this reaction can be important. For charginos 
of mass about 200 GeV, this annihilation cross section can be 50 pb for a 
pure wino or Higgsino, so only a modest content of these states is needed 
to give a cross section of 1 pb. 

The s-channel exchange of a Higgs boson can provide a mechanism for 
neutralino annihilation in the spin 0 S-wave. Because this state is CP-odd, 
it is the boson A? that is relevant here. If m4 is close to the neutralino 
threshold 2my, the cross section has a resonant enhancement. Note that 
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the Ñ; annihilation vertex to A arises as a Higgs-Higgsino-gaugino Yukawa 
term, so this vertex is nonzero only if Ñ, has both gaugino and Higgsino 
content. If ma = 2my, the resonance enhancement is at full strength and 
the cross section can be as large as 50 pb. Thus, it is A boson masses 
about 20 GeV above or below the threshold that give the desired cross 
section (11.249). 

The final mechanism shown in the figure is coannihilation. As we have 
discussed, the freezeout of the MN occurs at a temperature given by T/my ~ 
1/25. So if there is another particle in the supersymmetry spectrum that 
is within 4% of the N; mass, this state will have a number density that 
remains in equilibrium with the number density of the Ni. If this particle 
has S-wave annihilation reactions, those reactions can be the dominant 
mechanisms for the annihilation of supersymmetric particles. For a light 
slepton, the reactions 


OAN ae CAO (11.253) 


can give significant S-wave annihilation. In [106,109], the lighter stau is 
invoked as the coannihilating particle. In [110], the lighter top squark is 
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Fig. 11.34. Scatter plot of SUSY parameter points consistent with data from the 
LHC in the analysis of the parameter set LCC2 from [111]. The horizontal axis 
show the value of Qn at each parameter point. The vertical axes show polarized- 
beam cross sections measurable at the ILC, in fb: (a.) o(ezef > Cor: 
(b.) o(egef > NSN). The colored bands show the +1ø region allowed after 


the ILC cross section measurements. 
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invoked as the coannihilating state. If the lightest neutralinos and charginos 
are Higgsino-like, chargino coannihilation can also be important. 

It is, then, a complex matter to predict the neutralino relic density 
from microscopic physics. We will first need to learn what particles in the 
supersymmetry spectrum play the dominant role as particle exchanged in 
annihilation reactions or as coannihilating species. We will then need to 
measure the couplings and mixing angles of the important particles, since 
the dominant annihilation diagrams depend sensitively on these. 

Some examples of how measurements at the LHC and ILC can accu- 
mulate the relevant information are described in [111]. Figure 11.34 shows 
a part of the analysis of this paper for a particular SUSY model in which 
the dominant annihilation reactions are N,N, ~ WtW~-,Z°Z°. As a first 
step, the authors constructed numerous supersymmetry parameter sets that 
were consistent with the mass spectrum of this model as it would be mea- 
sured at the LHC. These parameter sets included a variety of models in 


LHC+ILC— 1000 
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Fig. 11.35. Summary plot for the prediction of Qy from collider data for the SUSY 
parameter set LCC2 considered in [111]. The three curves show the likelihood 
distributions for the prediction of Qn using data from the LHC, the ILC at 
500 GeV, and the ILC at 1000 GeV. 
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which the LSP was dominantly bino and wino. The figure shows scat- 
ter plots of the predictions of these models with ILC cross sections for 
neutralino and chargino pair production on the vertical axis and Qy on 
the horizontal axis. The two cross sections clearly separate the bino- and 
wino-like solutions. The second of these cross sections is the polarized reac- 
tion of chargino pair production for which the cross section is displayed in 
Fig. 11.19. The horizontal lines represent the accuracy of the measurements 
of these cross sections expected at the ILC. These measurements select the 
bino solution and also play an important role in fixing the bino-Higgsino 
mixing angle which is a crucial input to the annihilation cross sections. In 
Fig. 11.35, I show the distribution of predictions for Qy expected for this 
model, in the analysis of [111], from the data on SUSY particles that would 
be obtained from the LHC, from the ILC at a center-of-mass energy of 500 
GeV, and from the ILC at a center-of-mass energy of 1000 GeV. 

The similar summary plot for another of the models considered in [111] 
is shown in Fig. 11.36. The model considered in this analysis is one in 
which the neutralino relic density is set by stau coannihilation. In this 
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Fig. 11.36. Summary plot for the prediction of Qn» from collider data for the 
SUSY parameter set LCC3 of [111]. The notation is as in Fig. 11.35. 
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model, the stau would be discovered at the LHC, and the stau-neutralino 
mass difference would be measured to about 10% accuracy at the 500 GeV 
ILC. However, the annihilation reactions also depend on mixing angles and 
on the value of tan 8. In this scenario, these are determined only by ILC 
measurements of some of the heavier states of the SUSY spectrum. 

Collider measurements of the SUSY spectrum can also be used to con- 
strain cross sections of the WIMP that are important for experiments that 
seek to detect dark matter, for example, the neutralino-proton cross sec- 
tion and the cross section for neutralino pair annihilation to gamma rays. 
If we can accurately predict these cross sections from collider data, the in- 
formation about the SUSY spectrum that we learn from colliders will feed 
back into the astrophysics of dark matter. Some numerical examples that 
illustrate this are presented in [111]. 


11.7. Conclusions 


In these lectures, I have given an overview of supersymmetry and its appli- 
cation to elementary particle physics. In the early sections of this review, I 
presented the formalism of SUSY and explained the rules for constructing 
supersymmetric Lagrangians. Our discussion then became more concrete, 
focusing on the mass spectrum of the MSSM and the properties of the 
particle states of the MSSM spectrum. This led us to a discussion of the 
experimental probes of this spectrum and the possibility of measurement 
of the parameters of the supersymmetric Lagrangian. 

This possibility is now coming very near. As I have discussed in the last 
sections of this review, supersymmetry gives concrete answers to the major 
questions about elementary particle physics that we expect to be addressed 
at the hundred GeV scale—the questions of the origin of electroweak sym- 
metry breaking and the identity of cosmic dark matter. In the next year, 
the LHC will begin to explore the physics of this mass scale. Supersym- 
metry is one candidate for what will be found. I hope that, after studying 
these lectures, you will agree that the picture provided by supersymmetry 
is highly plausible and even compelling. 

Whatever explanations we will learn from the LHC data, our investiga- 
tion of it will follow the general paradigm that I have described here. In 
successive stages, we will use data from the LHC and the ILC to learn the 
mass spectrum of new particles that are revealed at the LHC, to determine 
their quantum numbers and couplings, and to reconstruct their underlying 
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Lagrangian. On the basis of the detailed studies of this program that have 
been carried out for the MSSM, we have the expectation that we will be able 
to learn the underlying theory of the new particles and to test the specific 
explanations that this theory gives for the mysteries of the fundamental 
interactions. 

Is supersymmetry just an attractive theory, or is it a part of the true 
description of elementary particles? We are about to find out. 
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